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Abstract 


Principles  for  the  selection  of  a  finite  element  method  for  a  particular 
problem  are  discussed.  These  principles  are  stated  in  terms  of  the  notion  of 
approximability,  optimality,  and  stability.  Several  examples  are  discussed 
in  detail  as  illustrations.  Conclusions  regarding  the  selection  of  finite 
element  methods  are  summarized  in  the  final  section  of  the  paper. 


1.  Introduction 


Larger  and  larger  classes  of  finite  element  methods  are  becoming 
available  for  the  approximate  solution  of  engineering  problems  and  the 
selection  of  a  method  for  a  particular  problem  is  an  increasingly  important 
question.  It  is  the  purpose  of  this  paper  to  discuss  some  principles  for 
the  selection  of  finite  element  methods. 

A  finite  element  method  or,  more  generally,  a  variational  method  is 
a  discretization  of  a  variational  (weak)  formulation  of  the  problem  under 
consideration.  More  specifically,  it  consists  of  several  items. 

1)  Selection  of  a  variational  (weak)  formulation  of  the  original 
problem.  There  are,  in  fact,  many  such  formulations  and  their 
choice  can  significantly  affect  the  resulting  finite  element 
method.  The  choice  of  a  variational  formulation  leads  to  the 
choice  of  a  bilinear  form. 

2)  Selection  of  a  trial  space.  The  trial  space  consists  of  those 
elements  (shape  functions)  with  which  the  solution  will  be 

be  approximated.  It  is  thus  chosen  so  as  to  provide  good 
approximation  properties.  The  choice  of  trial  space  depends, 
of  course,  on  the  set  of  possible  exact  solutions  under  consideration. 

3)  Selection  of  a  test  space.  This  space  is  chosen  so  that  the 
approximate  solution  is  easily  computed  and  so  that  the  error 

is  comparable  with  the  error  in  the  best  possible  approximation 
achievable  by  elements  in  the  trial  space. 

4)  Selection  of  the  norm.  The  selection  of  the  norm  relates  to  the 
measure  of  acceptability  of  the  approximate  solution  and  thus 
depends  on  the  goals  of  the  computation. 

5)  Selection  of  the  extension  procedure.  This  procedure  describes  the 
manner  in  which  the  trial  and  test  spaces  (and  possibly  the 


variational  formulation  are  changed)  when  the  desired  accuracy  is  not 
achieved  and  the  approximate  solution  has  to  be  improved. 

The  selection  of  a  method  for  a  specific  problem  depends  on  the  goals 
of  the  computation,  implementational  questions,  and  other  practical  circum¬ 
stances.  Selection  and  comparison  of  methods  is  not  a  simple  task  and  in 
order  to  find  "optimal"  methods  it  is  important  to  clarify  as  much  as 
possible  the  basic  notion  of  a  variational  method  and  criteria  by  which 
different  methods  can  be  compared.  It  is  necessary  to  emphasize  that  selection 
of  methods  depends  on  many  factors  and  will  always  have  a  relative  character. 

The  influence  of  computer  technology  on  the  selection  process  could  be 
especially  important. 

Let  us  turn  now  to  a  brief  outline  of  the  paper.  The  paper  is  partially 
expository  in  nature,  with  the  mathematical  results  having  primarily  an  illustra¬ 
tive  as  opposed  to  a  practical  importance. 

Section  2  deals  with  the  principle  ideas  of  a  variational  method. 

Subsection  2.1  introduces  the  notions  of  simple  variational,  variational, 
directed  variational,  and  computational  variational  methods,  concepts 
we  view  as  important  for  the  discussion  of  the  multiplicity 

of  methods  considered  today  in  theory  and  practice.  Subsection  2.2  discusses 
approximability  and  optimality.  Approximability  refers  to  the  quality  of  best 
approximation  achievable  by  the  trial  space  (see  2)  above)  and  optimality 
refers  to  the  comparison  of  the  approximation  yielded  by  the  finite  element 
solution  and  the  best  possible  approximation  achievable  by  elements  in  the 
trial  space  (see  3)above).  In  Subsection  2.3  optimality  is  elaborated  on  and 
related  to  stability.  We  introduce  the  stability  constant  which  is  often 
relatively  easy  to  estimate  and  in  terras  of  which  one  can  estimate  the  optimality 
constant.  The  ideas  introduced  in  Section  2  are  illustrated  by  a  series  of  exampl 


In  Section  3  we  consider  a  further  example  of  a  finite  element  methods 


which  illustrates  several  of  the  ideas  introduced  in  Section  2. 

In  Section  4  we  summarize  those  conclusions  regarding  the  selection 
of  finite  element  methods  that  can  be  drawn  from  the  discussion  in 
Sections  2  and  3. 

Throughout  the  paper  we  will  use  certain  function  spaces.  For  an 
interval  I  =  (a, 6)  , 


(u 


L  (I)  =  K 

P  P 


u  bounded}  ,  p  =  <*>  . 


On  L  (I)  we  use  the  norm 
P 


-  < 


((  |u|Pdx)1/p  ,  1  $  p 


<  au 


ess  sup|u(x) 
V.  x€I 


P  = 


Ic  k 

H  Cl)  =  H  is  the  usual  Sobolev  space  of  functions  whose  first  k  derivatives 
are  in  L^CT)  .  On  this  space  we  use  the  norm 


I  \r 

HK(I) 


Ij] 


1/2 


■L2(I) ) 


,[j]  =  d^u 

dxj 


By  C  (I)  we  denote  the  space  of  functions  whose  first  ('  derivatives  are 
continuous  on  I  =  (a, (3]  with  the  norm 


I 


I  P  — 
Cl  I) 


1  max|u^^(x)| 

j=0  x€ T 


(L  )  will  denote  the  spaces  of  pairs  of  functions  in  L 


2 .  Variational  Methods 
2.1.  Fundamental  Ideas 

Throughout  this  paper  we  suppose  we  are  interested  in  approximating 

the  unique  solution  u^  of  some  problem  by  a  variational  method  of 

discretization.  (It  is  not  necessary  to  give  a  precise  statement  of  the 

problem  here.)  In  this  section  we  shall  formulate  some  important  ideas 

which  will  allow  us  to  discuss  variational  methods  of  discretization. 

A  simple  variational  method  is  specified  by  a  linear  vector  space  H, 

a  finite  dimensional  subspace  S  c  H  called  the  trial  space,  a  second 

finite  dimensional  space  V  of  the  same  dimension  as  S  called  the  test 

space,  and  a  bilinear  form  B  defined  as  H  x  V  •  We  assume  that  B(s,v) 

is  regular  on  S  *  V  ,  i.e.,  we  assume  that  for  every  0  /  s  €  S  there  is 

a  v  €  V  such  that  B(s,v)  ^  0  .  This  condition  is  referred  to  as 

N  N 

regularity  since,  if  and  are  bases  for  S  and  V  , 

respectively,  then  B(s,v)  is  regular  if  and  only  if  the  matrix 
is  regular  (or  invertible) .  We  will  denote  the  simple  variational  method 
by  the  four-tuple  M  =  (H, S,V,B). 

M  is  used  to  determine  an  approximate  solution  Uq(M)  €  S  .called 
the  M-approximate  solution,  to  the  exact  solution  Uq  ,  which  is  assumed  to 
lie  in  H  ,  by  requiring  that 

(2.1.1)  B(Uq(M) ,v)  =  B(uQ,v) 

holds  for  all  v  €  V  .  Using  the  bases  {cp  }  rnd  {\K  }  and  writing 
N  1  1 

u_(M)  =  £c.<p,  ,  we  see  that  (2.1.1)  is  equivalent  to  the  system  of  equations 
0  1  J  J 

N 

l  B((p^.,fi)cJi  =  B(uQ,ti)  ,  i  =  1,...,N  . 


Since  the  matrix  of  this  system  is  regular,  the  coefficients  c.  and  hence 
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the  approximate  solution  Uq(M)  is  uniquely  determined.  In  this  way  we 
associate  to  each  u  C  H  the  M-approximate  solution  u(M)  which  is  also 
denoted  by  Pu  or  P(M)u  .  Note  that  Ps  =  s  for  any  s  €  S  and  there¬ 
fore  that  P  is  a  projection.  We  thus  see  that  to  any  simple  variational 
method  we  uniquely  associate  the  projection  P  =  P(M)  of  H  onto  S  . 

Often  we  will  write  M(S,V)  instead  of  M  to  underline  the  dependence 
S  and  V  ,  especially  if  the  H  and  B  under  consideration  are  clear 
from  the  context.  Likewise,  instead  of  P(M)  and  u(M)  we  write  P(S,V) 
and  u(S,V)  ,  respectively. 

The  space  H  must  be  known,  a  priori,  to  contain  the  exact  solution 
Ug  .  Furthermore,  the  bilinear  form  B  must  be  such  that  B(Ug,v)  can 
be  computable  from  the  data  that  determines  the  exact  solution  u^  ,  without 
knowing  u^  explicitly.  Let  us  note  that  in  this  section  we  are  not 
discussing  the  questions  of  how  well  the  M-approximate  solution  Uq(M) 
approximates  the  exact  solution  u^  .  This  is  addressed  in  the  next  section. 
Since  the  exact  solution  could,  a  priori,  be  any  element  in  H  we  will  often 
denote  it  by  u  . 

We  now  formulate  some  typical  examples;  these  will  be  elaborated  on  in 
the  remaining  sections. 

Example  2.1.1. 

Let 


(2.1.2) 


an(x)  a12(x) 


a21(x)  a92 (x) 


be  a  matrix  defined  in  I  =  (-tt,tt)  ,  where  a..  CL  (I)  ,  and  A  is 

1J  00 

regular  (invertible)  with 


cu(x)  C  (X) 


c2i (x)  c22(x) 
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.  -v-y  y- -y.yy\\-.y.y •-V-.V- ‘~\y.‘>y --Xv^n.sv. 


where  c^  €  1.^(1)  .  We  then  consider  the  problem  of  finding 
u(x)  =  (uI1](x),ul21(x))T  such  that 


(2.1.3) 


A(x)u(x)  =  f (x) 


for  a  given  f^(x)  =  (f  ^  (x)  ,f  ^  (x) )^  ,  where  _f  €  (I^)^  . 

A  simple  variational  method  M  =  (H,S,V,B)  is  determined  by  the 
choices 


h  =  a2(i)r  , 


S  =  V  =  {(s[11,sl2]):sIkl  =  l  cjk]sin  jx  +  £  discos  jx  , 


L  1 
j=l  3 


j=0  J 


and  dj  J  real,  k  =  1,2}  , 


(2.1.4) 


fTf  „ 

T 

(u,v)  =  v  Au 
J-TT 


S  and  V  have  the  same  dimension,  namely  N  =  4n  +  2  .  We  will  later 

show  that  B  is  regular  on  S  x  V  under  certain  additional  assumptions. 

2  2 
For  any  £  €  (L^I))  ,  the  exact  solution  _u  of  (2.1.3)  lies  in  (L^d)) 

and  B(u,v)  is  computable  in  terms  of  _f  without  knowing  u  since 


we  have 


fn  T 

(u,v)  =  I  v  i_  dx  . 

'  -Tf 


2  2 
(We  note  that  we  could  also  take  H  =  (L^(I))  and  allow  f_  €  (L^(I))  , 

which  by  our  definition  would  mean  a  different  simple  variational  method.) 

Example  2.1,2 

Consider  the  problem  of  determining  u(x)  so  that 


(2.1.5) 


(-  u"(x)  =  f (x)  ,  x  e  i  =  (-1,1) 

\  u(-l)  =  u(l)  =  0 
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where  f  t  I^CI) 
a)  Let 


H  =  H1  =  H1  =  {u(x):u  (  H1(I)  ,  u(-l)  =  u(l)  =  0}  , 


S  =  V  =  (s  is  a  polynomial  of  degree  n,  s(-l)  =  s(l)  =  0} 


For  f  €  ,  the  exact  solution  u  of  (2.1.5)  lies  in  H 

The  dimension  of  S  and  V  is  n  -  1  .  Finally  we  take 


!(u,v)  =  -  (  uv"dx  . 
J-1 


It  is  easy  to  see  that  B  is  bilinear  on  *  V  ,  is  regular  on  S  *  V  , 


and  that 


B(u,v)  =  fv  dx 
J-1 


for  any  v  €  V  ,  where  u  is  the  exact  solution  of  (2.1.5). 

b)  We  can  also  take  H  =  =  L2(I)  .  B(u,v)  is  still  defined  and 

bilinear  in  x  V  .  This  choice  for  H  is  important  if  f  is,  for  example, 
a  dipole,  i.e.  ,  the  derivative  of  the  Dirac  function.  In  this  situation 
u  will  be  in  H ^  but  not  in  H  .  Thus  the  choice  H  =  allows  us  to 
treat  (2.1.5)  with  f  a  dipole.  Note  that  computationally  the  methods 
are  identical,  but  according  to  our  definition  we  are  dealing  with  two 
different  simple  variational  methods,  namely  (H^,S,V,B)  and  (H^, S,V,B). 

Example  2.1.3 

Consider  the  problem  of  finding  u(x)  such  that 


(2.1.6) 


(x)  =  f(x)  ,  x  e  i  =  (o,i) 
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Let  A 


{0  =  x„  <  x,  <  •  •  •<  x  =1}  ,  where  x,  =  j/n  ,  be 

0  1  n  J 


a  uniform  mesh 


on  I  and  set  I.  =  (x.  ,  ,x.)  ,  h.  =  x.  -  x.  ,  =  1/n  ,  and  h  =  1/n  . 

J  j-1  j  j  J  j-1 

Then  let 

H  =  H1  =  {u  :  u  €  H1(I)  ,  u(0)  =  0}  , 


S  =  =  (s(x)  :  s  is  continuous  on  I  ,  s  is  linear  in  each  1.  , 

s(0)  =  0  .} 


For  the  test  spaces  we  will  consider  two  choices: 

a)  V  =  n  =  {v(x)  :  v(x)  is  continuous  on  I  , 

v  is  linear  on  each  1^  ,  v(l)  =  0}  . 

b)  V  =  V„  =  V„  {v(x)  :  v(x)  is  constant  on  each  I.}  . 

Z  Z ,  n  j 

In  both  cases  we  consider  the  bilinear  form 


B(u,v)  -  u'v  dx  . 

I 

It  is  easy  to  see  that  B(u,v)  is  regular  on  S  x  V  in  both  cases. 

If  f  €  L2(I),  then  the  solution  u  of  (2.1.6)  belongs  to  H  and  for 
any  v  €  V  , 

B(u,v)  =  I  fv  dx  . 

I I 


It  often  occurs  that  the  approximate  solution  produced  by  a  simple 
variational  method  is  judged  to  be  insufficiently  accurate  and  then 

another  method  is  chosen  which  will  give  better  accuracy.  If  necessary, 

i- 

a  third  method  is  chosen,  and  so  forth.  Although  only  finitely  many 

computations  can  be  performed  in  practice,  if  a  procedure  is  given  for 
determining  the  new  simple  method  from  the  previous  ones,  one  is 

lead  to  an  infinite  sequence  or  familv  of  simple  variational 

methods,  chosen  to  produce  an  approximate  solution  with  acceptable  accuracy 
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by  considering  sufficiently  many  of  the  simple  methods  in  the  family.  The 

methods  M  are  often  chosen  so  as  to  share  the  same  bilinear  form  B  . 
n 

We  thus  suppose  we  are  given  a  pair  of  vector  spaces  H  and  (/  ,  a 

bilinear  form  B  on  H  *  V  ,  and  consider  a  family  F  of  pairs  (S,V)  of 
subspaces  Sc  H  and  V  c  \J  so  that  dim  S  =  dim  V  <  »  and  B  is 
regular  on  S  *  V  .  The  family  of  simple  methods  M(S,V)  =  (H,S,V,B)  for 
(S,V)  £  F  will  be  denoted  by  M  =  (H,l/,F,B)  and  will  be  called  a 
variational  method.  A  simple  method  M  €  (H,l/,F,b)  is  completely 
characterized  by  the  pair  (S,V)  £  F  . 

In  connection  with  such  families  of  simple  methods  it  is  useful  to  speak 
of  error  (absolute  and  relative)  convergence,  and  other  asymptotic  concepts, 
because  the  final  aim  is  to  obtain  an  M-approximat e  solution  which  approx¬ 
imates  the  exact  solution  sufficiently  well.  Suppose  X  H  is  a  Banach 
space  with  norm  l*lx’  we  define  the  absolute  error  in  the  approximate  solution 

to  be  |u-u(S,V)|  .  Let  a  be  a  function  associating  to  every  pair  (S,V)  £  F 

X 

a  real  positive  number  a(S,V)  ;  a(s,v)  will  be  called  the  discretization 
parameter  associated  with  the  pair  (S,V)  .  We  say  u(S,V)  converges  to 
u  in  X  as  a(S,V)  -+  0  ,  written 

lim  u(S,V)  =  u  , 
a(S,V)-K) 

if  for  each  t  there  is  a  5  >  0  such  that 

|u(S,V)-u|  <  r 

for  any  (S,V)  €  F  satisfying  a(S,V)  <  5  .  We  will  often  now  consider  H 
to  be  equipped  with  the  norm  |  •  |  .  The  family  (H,X,l/,F,B)  together 

with  the  discretization  parameter  a  will  be  called  a  directed  variational 


method.  For  such  directed  variational  methods  we  will  use  the  notation 
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M  =  (H,X,V,F,B,a)  .  The  use  of  M  for  both  the  family  of  simple  methods 

and  the  directed  method  will  not  cause  confusion. 

When  implementing  a  variation  method  we  select  a  sequence  (S  ,V  )  €  F  , 

n  n 

defining  the  simple  methods  =  MCS^.V^)  ,  and  compute  the  M^approximate 

solutions  u^  =  u(S^,V^)  with  increasing  n  until  an  acceptable  result 

has  been  attained.  The  process  of  selecting  (S  V  )  from  (S.  ,V.)  , 

n+1  n+1  i  x 

i  =  l,...,n,  and  possibly  from  u(Sn>Vn)  anc*  other  available  information 
will  be  called  an  extension  procedure.  The  sequence  {Mn }  will  be  called 
a  computational  variational  method.  Usually  our  acceptance  criterion  will 
be  quantified  in  terms  of  a  norm,  as  described  above. 

Let  us  return  now  to  our  examples.  We  will  elaborate  in  the  examples 
introduced  earlier. 

Example  2.1.1* 

Let  F  be  the  family  of  pairs  (S,V)  ,  where  S  =  V  is  the  space  of  trigno- 

2 

metric  polynomials  of  arbitrary  degree  n.  Further  we  select  V  =  H  =  (L^Cl))  . 

By  this  selection  we  have  characterized  a  variational  method  M  =  (H,  l/,F,B)  . 

Choosing  X  =  L„(I)  ,  |«|  =  |*|  and  a(S  ,V)  =  —  .where  n  is  the 

^  L2  ^ 
degree  of  the  polynomials  in  S  (dim  S  =  4n+2)  ,  we  define  the  directed 

variational  method  M  =  (H,X, V,F,B,a)  . 

Selecting  (S^.V^)  ^  F  ,  =  Vn  being  the  space  of  trigonometric  polynomial 

of  degree  n  ,  the  extension  procedure  consists  of  increasing  the  polynomial  degree 

by  one.  The  sequence  of  simple  methods  {M  }  ,  M  =  (H, S  ,V  ,B),  is  a 

n  n  n  n 

computational  variational  method.  A  different  extension  procedure  and  computa¬ 
tional  method  will  be  achieved  if,  for  example,  the  degree  of  the  polynomials 
is  increased  in  different  way,  say  by  2. 

Example  2.1.2* 

Quite  analogously  as  in  the  first  example  we  select  for  F  the  family 

of  pairs  (S  ,V  )  ,  S  =  V 
n  n  n  n 


being  the  space  of  the  algebraic  polynomials  of 


11 


[ 

i 

i 

►-I 

s 


o 

degree  n  with  zero  values  at  x  -  0  ,  x  =  1  .  In  the  case  a)  we  choose  1/  =  H 

X  =  H^"  and  in  the  case  b)  U  =  X  =  L„(I)  .  We  further  choose  a(S  ,V  )  =  — 

*2  n  n  n 

where  n  -  1  =  dim  .  The  extension  procedure  would  consist  now  in  some  specif 
manner  of  increasing  the  degree  of  the  polynomials.  The  notions  of  a  variational] 
directed  variational,  and  computational  variational  method  are  now  obvious. 
Example  2.1.3* 

In  the  case  a)  we  obviously  select  for  V  the  space  of  all 
continuous,  piecewise  linear  functions  subordinate  to  any  uniform 
mesh  which  vanish  at  1  .  In  the  case  b)  we  let  1/  be  the  space 

of  all  piecewise  constant  functions  subordinate  to  any  uniform  mesh.  VJe  choose 
the  discretization  parameter  a(S,V)  =  h  =  ^  and  in  both  cases  we 
consider  X  =  H  .  Note  that  in  examples  2.1.1  and  2.1.2  we  have  S  c  S  , 

V  c  V  whenever  a(S,V)  >  a(S,V)  but  that  this  is  not  the  case  in  Example 
2.1.3*. 


In  Example  2.1.3,  instead  of  X  =  H  we  could  select  X  =  , 

etc.  We  could  also  choose  H  =  fl  ^  ,  k  5  2  ,  instead  of  H  =  . 


2.2.  Approximability  and  Optimality 

Consider  a  simple  variational  method  M  =  (H,S,V,B)€M  ,  where  M=  (h,X,l/,F,B,ct 
is  a  directed  variational  method.  The  purpose-  of  a  variational  method  is  to 
obtain  an  M-approximate  solution  u(M)  such  that  the  error  (say  relative  Is 
smaller  than  a  given  tolerance  r  .  This  means  that  we  wish  to  select  M  so  that 

(2.2.1)  | u(M)  -  u|x  2  t!u|x 


where  u  is  the  exact  solution  of  our  problem.  Thus  for  a  given  problem 
the  fun  amental  goal  is  as  follows:  given  X  and  t  ,  we  wish  to  choose 


M  =  (H,S,V,B)  €  M  so  that  (2.2.1)  is  satisifed  in  the  most  effective  way. 
The  word  "effective"  must  of  course  be  understood  in  a  manner  appropriate 
to  the  particular  situation. 


To  achieve  (2.2.1)  it  is  certainly  necessary  that 

inf  |u-s|x 

(2.2.2)  Z  (u,  S ,  X)  =  — — 

u 

The  quantity  Z(u,S,X)  measures  the  relative  error  in  the  best  possible 
approximation  of  u  by  elements  of  S  with  respect  to  the  chosen  norm 
| ‘|  ,  i.e.,  it  measures  the  approximability  of  u  by  S  with  respect  to 

X  and  is  called  the  approximability  constant  of  S  on  u  with  respect  to  X  or, 
more  briefly,  the  approximability  constant. 

That  the  trial  functions  are  able  to  approximate  the  solution  well, 
i.e.,  that  Z(u,S,X)  is  small,  does  not  alone  insure  that  the  M-approximate 
solution  u(M)  is  close  to  the  exact  solution  u  .  We  therefore  introduce 
the  ratio  of  the  relative  error  in  u(M)  to  the  relative  error  in  the  best 
approximation.  For  any  u  €  H  ,  with  |u|x  >  0  define  C(u,M,X)  5  1  by 

|u(M)  -  u| 

(2.2.3)  -  =  C(u,M,X)  Z ( u , S , X)  . 

If  Z(u,S,X)  =  0  we  set  C(u,M,X)  =  1  .  The  quantity  C(u,M,X)  measures  the 

optimality  of  the  approximate  solution  chosen  by  M  and  is  called  the  optimal lty 
con stant  of  M  on  u  with  respect  to  X  or,  more  briefly,  the  optimality 
constant .  When  C(u,M,X)  is  near  1  the  approximate  soluton  u(M)  is  nearly 
as  good  as  the  best  possible  approximation  using  the  trial  space  S  .  We 
emphasize  that  while  Z(u,S,X)  is  independent  of  the  form  B  and  the  test 
space  V  ,  the  optimality  constant  C(u,M,X)  depends  on  S,V,B,X,  and  u  . 


The  acceptance  criterion  (2.2.1)  is  thus  simply  that  the  product  of 
C(u,M,X)  and  Z(u,S,X)  does  not  exceed  t  . 
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Although  the  exact  solution  u  is  unknown  in  any  practical  problem, 
we  often  know  some  properties  of  u  ,  namely  that  it  belongs  to  H  or  to 
H  ,  where  H  is  a  subset  of  H  .  We  define  the  approximability  constant 
of  the  space  S  on  H  with  respect  to  X  as  the  number 

Z(H, S , X)  =  sup  Z(u,S,X) 
u€H 

and  the  optimality  constant  of  M  on  H  with  respect  to  X  as 

C(H,M,X)  =  sup  C(u,M,X)  . 
u€H 

We  extend  the  notion  of  optimality  constant  to  the  directed  variational 
method  M  =  (H,  X,  l/,  F ,  B ,  a)  by  defining 

C(H,M,X)  =  sup  C (H,M, X) 

M€M 

-  C  (H,M(S,V),X)  . 

(S,V) (F 

The  directed  variational  method  M  is  calle^  quasi-opt imal  on  H  with 
respect  to  X  if  C(H,M,X)  is  finite. 

The  set  H  c  H  can  be  arbitrary.  E.g.  in  the  Example  2.1.2  (2.1.2*) 

ol 

in  case  a)  we  have  chosen  H  =  H  but  we  could  select  for  example 
H  =  H  ,  k  ■  1  .  We  also  can  select 

H  =  {u€H^(I)  0  :  u"(x)  >  a|uj  on  (<o,r))  ,  where  |co-r)|  =  a,? }  ,  >  0  , 


etc.  Note 

that 

Z  and  C 

are  homogeneous 

with  respect  to  u 

i .i  .  X (on , S , X) 

=  Z  (u ,  S ,  X) 

anti 

C(cu,M,X)  = 

C(u,M,  X)  and  so 

we  can  rest  riot  our 

interest 
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only  to  functions  u  lying  on  the  unit  sphere. 

The  main  use  of  the  optimality  constant  C(u,M,X)  and  C(H,M,X)  is 

to  provide  an  estimate  for  the  absolute  error  |u-u(M)|^  in  terms  of  the 

error  of  best  approximation  inf  |u-s|„  .  From  (2.2.3)  we  have 

s€S 

| u-u(M) | x  <  C(u,M,X)Z(u,S,X) |u|x 

=  C(u,M,X)  inf  |u-s| 
s€S 

which  holds  for  any  u  €  H  .  For  any  u  €  H  we  have 

| u-u(M) | y  5  C(H,M,X)  inf  |u-s|Y  . 

X  s€S  X 

As  we  have  seen  above,  the  effectiveness  of  a  simple  variational  method 
is  influenced  by  two  factors:  approximability ,  which  depends  on  the  selec¬ 
tion  of  the  trial  space  S  ,  and  optimality,  which  depends  on  the  selection  of 
S  ,  V  ,  and  B  .  Give  X  ,  we  thus  want  to  choose  M  so  that  C(u,M,X)  and 
Z(u,S,X)  are  as  small  as  possible.  Let  us  now  turn  to  a  more  detailed  dis¬ 
cussion  of  C(u,M,X)  .  As  indicated  above  we  want  C(u,M,X)  to  be  not 
much  large  than  1  .  Now,  it  is  known  that  C(u,M,X)  can  depend  strongly 
on  u  (this  will  be  illustrated  later  by  examples) .  In  order  to  make 
precise  and  quantify  this  notion  we  say  that  the  solution  u  is  K-perf ect 
or  K-perfect  with  respect  to  the  method  M  if  C(u,M,X)  <  K  .  We  will  say 
u  is  perfect  if  it  is  K-perfect  with  a  small  K  otherwise  we  will  call  it 
imperfect.  A  computational  variational  method  {M..  }  will  he  said  to  be  }- 
perfect  if  C(u,M^,X)  S  .  In  practice  we  would  usually  want  {K^}  to  be 
hounded  by  a  fairly  small  number  or  to  be  increasing  only 
slowly.  We  note  that  even  if  C(u,M^,X)  -*■  °°  ,  u(M^)  may  still  converge 


to  u,  for  this  particular  u,  since  the  product  C(u,M^,X)Z(u,S^,X) 
may  still  approach  zero. 

Since  the  method  M  is  defined  for  all  u  €  H  ,  for  a  given  K  it  is 
natural  to  attempt  to  characterize  the  set  H  of  all  u  €  H  for  which 

C(u,M,X)  5  K  , 

i.e.,  to  haracterize  the  largest  set  H  satisfying 

C(H,M,X)  5  K  . 

Usually  it  is  not  easy  to  describe  the  set  in  such  a  way  that  one  could 
in  practice  decide  whether  the  exact  solution  belongs  to  it,  except  when 
H  =  H  =  X  . 

The  observations  that  C(u,M,X)  may  depend  strongly  on  u  is  very 
important.  It  is  common  practice  to  attempt  to  draw  conclusions  about  the 
performance  of  a  method  M  from  experimental  computations.  These  conclusions 
could  be  misleading  since  the  computations  may  have  been  made  for  a  solution 
u  which  is  K-perfect  with  a  small  K  ,  and  for  which  Z(u,S,X)  is  small. 

These  conclusions  could  then  be  false  when  some  other  solutions  are 
considered . 

This  situation  is  related  to  the  notion  of  robustness .  A  simple  method 
M  or  a  directed  method  M  is  said  to  be  robust  if  it  performs  well  in 
relatively  general  circumstances.  It  may  be  that  a  less  robust  method 
performs  better  in  certain  situations.  There  are,  in  fact,  methods  which 
perform  well  in  certain  situations  but  which  fail  to  converge  in  others; 
this  is  the  extreme  in  nonrobustness.  The  importance  of  robustness  has 
to  be  seen  in  connection  with  the  obse- vat  ions  that  computational  cost  is 
becoming  a  smaller  and  smaller  part  of  the  total  cost  of  engineering  analysis. 
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In  summary,  as  much  as  possible  we  would  like  to  have  a  directed  method 
M  which  is  quasi-optimal  on  H  -  H  (usually  H  =  X)  with  the  constant 
C(H,M,X)  not  too  large. 

Next  we  make  some  simple  observations  connecting  approximability , 
optimality,  and  convergence,  and  rate  of  concergence  of  a  directed  varia¬ 
tional  method.  A  directed  variational  method  is  called  convergent  on  H  if 

(2.2.4)  lim  |u-u(S,V)|  =0 

a(S,  V)-K) 

for  all  u  €  H  .  If  r  is  a  function  defined  on  F  satisfying 

lim  r(X,V)  =  0  ,  then  the  method  is  convergent  with  rate  r  on  H  if 

a(S, V)  K)  ”  '  ~ . 

| u-u(S,V) |x 

(2.2.5)  sup  sup  - <  «°  . 

u€H  (S,V) €  F  r (S,V) 


We  can  also  define  the  rate  of  convergence  of  a  computational 
variational  method  {M^}  .  {M^}  is  said  to  converge  with  rate  r  if 


(2.2.6) 


sup 

u€H 


sup 

i 


u-u(S1,V1) |x 


r(Si"V 


ao 


We  note  that  if  a  directed  variational  method  is  quasi-optimal,  then  (2.2.4), 
(2.2.5),  or  (2.2.6),  respectively,  holds  if  and  only  if 

lim  Z(u,S,X)  =  0  , 
a(S  ,VH 

Z(u,S,X)|u| 

sup  sup _ 'X  m 

u^H  (S,V)€F  r(S,V) 


or 


sup  sup 
u€H  i 


Z(u,S1,X)|u| 

r(S.  ,v7) 

l  i 
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respectively,  hold. 

We  now  return  to  our  examples. 

Example  2,2.1. 

We  consider  the  problem  introduced  in  Examples  2.1.1  and  2.1.1*  with 
the  matrix  A(x)  chosen  as 

(X-cos  x  sin  x 
-sin  x  X-cos  x 

with  X  real,  X  ^  +1,0  .  In  the  discussion  of  this  problem  it  will  be 
convenient  to  use  complex  notation  rather  than  vector  notation.  Thus 
regarding  _u  and  as  complex  valued  functions  we  can  write 

Au  =  f  as 


(2.2.7)  (X-e1X)u  =  f 

where  here  we  are  using  Complex  multiplication.  For  test  and  trial  space 
we  now  take 


j=-n 


c  .e 
3 


c . 
3 


complex}  . 


The  bilinear  form  will  now  be 


B(u,v) 


f- 

■'-TT 


u(X-e 


IX 


)v  dx  . 


It  is  easy  to  see  that  the  bilinear  form  is  regular  on  S  x  V  under  the 
assumptions  on  X  imposed  above.  We  let  H  =  V  =  L2(I)  and  X  =  L2(I)  , 
where  here  L^CI)  is  the  complex  space,  i.e.,  the  space  of  square 


integrable  complex  valued  functions  with  the  norm 
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If  X  4  0  ,  this  system  is  uniquely  solvable  and  we  obtain 


(2.2.11) 


'-n+j 


*  -  X 


•i-J 


>  j  0,1, . . .  ,2n 


It  follows  from  (2.2.9)  and  (2.2.10)  that 


Z(„,S  X)2  -  2n<  l  |c  I2)|u|"2 

“  lil>n  2 


and  from  (2.2.9)  -  (2.2.11)  that 


2h 

I  u-u  h  =  |c  1l  x  2 n  l  X  +  Z(u,S  X)2ju 
n  X  -n-i  j  _q  n 


2 

X 


I  1 2-2  1-X 

=  c  .  X  2n  - 

'  -n-1 1 


-An-2 


1-X 


_2  +  Z(u,S  ,X)2|u|J 


i 


and  therefore  that 


1 


Ic.^1 


-An-2 


(2.2.12) 


C~(u,M  ,X)  =  1  + 
n 


1-X 


-2 


hi 


l  lc 
>n 


We  vd.ll  now  analyze  separately  the  cases  |\|  >  1  and  |\|  <  1 


a)  First  we  assume  |\|  >1  .  Then  from  (2.2.12)  we  see  that 


C(u,M  ,X)  S  K  =  (1  + 

n  \  X2-l 


1/2 


for  all  u  €  L2(I)  and  for  all  n  ,  and  thus 


C(H,M,X)  5  K 


The  method  is  therefore  quasi-opt imal  on  H  with  respect  to  X  and  all 


solutions  are  perfect. 
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b)  Now  assume  |\  <  1  .  Then  we  see  that 


C2(H,M  ,X)  =  1  +  X  2  — - 
n 


-An-2 


1-X 


-2 


and  the  optimality  constant  deteriorates  very  quietly  with  n  and  we  have 

C(H,M,X)  =  +  “  . 

Let  now  c  H  ,  £  =  1,2,  be  defined  by 

2  2  2 

=  {u  €  L2(I) :  |c_n_^  |  >  a  J  |c  |  for  all  n}  ,  0  <  a  <  1 

|j|>n  J 

and 

H2  =  {u  €  L2(l):c_k_1  =  0  ,  k  >  0} 


where 


obviously 


are  coefficients  of  the  exact  solution  in  (2.2.9). 


Then 


C(u,M(Sn,Vn)  ,  X)  ?  X  2n  Xa 


for  all  u  f  and  we  see  that  all  solutions  belonging  to  are  very 

imperfect  for  large  n  .  On  the  other  hand  it  is  easy  to  check  that  all 
solutions  in  H2  are  perfect  and  M  is  quasi-optimal  on  H2  (C(H2,M,X)  =  1 
Assume  now  that  the  extension  procedure  is  such  that  only  ^2k’^2k^’  ^  1 

are  used.  If 


H3  =  {u  €  L2(I)  :  c_£  =  0  ,  i  >  1  odd} 

then  the  method  is  once  more  quasi-optimal  on  .  (If  another  extension 

procedure  would  he  used  then  these  solutions  could  he  very  imperfect.) 

Consider  now  the  subset  of  H.  of  H.  defined  bv 

A  1  3 


21 


=  {u  €  H1  :  2u£|Cj|2|\j  =  ||u||2  <  a  <  »}  , 


K  >  2  . 


Then  for  u  6  H.  we  have 
4 


u|2  Z(u,S  ,X)2  5  2n  |\!2(n+1)K  £  |c.|2[X|  2lCj  5  |  X  | 2  (n+1)K  ||u||2 


'X2  '  ’  n 


and  so 


|u-unlx<CM  |xr2n-2|x|(n+1>"  <  Cllull  |x|<n+l)(-2-Kc)  . 


Hence  we  see  that  the  method  converges  for  u  €  ,  since  <  >  2  ,  instiite 

of  the  fact  that  the  solution  is  very  imperfect.  (Nevertheless  we  can 

expect  computational  difficulties  caused  by  round  off  errors.)  Using  the 

discretization  parameter  a(S  ,V  )  =  n  ^  we  see  that  we  have,  in  the  above 

n  n 

mentioned  case,  the  rate  of  convergence  r  =  (X|^n+^^  24k)  ^  e  ^  exponentional 

rate  of  convergence)  which  of  course  is  lower  that  the  error  achievable 

by  S  and  characterized  by  Z(u,S  ,X)  . 
n  n 

c)  Let  us  now  consider  the  trial  space  as  before  but  change  V 

A 

to  V  defined  by 
n 


V  =  V  ={v:v=  y  c.e1'JX  ,  c.  complex}  . 
n  j=-n+l  J  J 


Now  the  c (n)  are  determined  by  (2.2.8)  with  j  =  -n+l,...,n+l  .  Repeating 

now  the  analysis  we  easily  see  that  for  | X j  <  1  ,  the  method  is  quasi- 

optimal  on  L2(I)  but  it  faces  the  difficulties  for  |x|  >  1  which  are 

analogous  to  those  faced  in  the  case  I X I  <  1  when  (S  ,V  )  was  used. 

n  n 

Let  us  now  summarize  some  main  points  we  have  seen  in  this  example. 

1)  If  we  select  S^,V^  for  the  test  and  trial  space,  then  in  the  case 

| X |  >  1  all  solutions  u  €  L2(I)  are  perfect  and  the  method  is 
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quasi-optimal .  In  the  case  |X|  <  1  there  are  solutions  which  are  very 
imperfect  but  there  exists  a  large  class  of  solutions  which  are 
perfect . 

2)  Although  a  solution  can  be  very  imperfect,  the  method  still  can 
converge. 

3)  The  optimality  constant  C(u,Mn,X)  could  deteriorate  exponentially 
with  n  and  numerical  experiments  will  most  likely  indicate  this  very 
quickly,  because  it  is  unlikely  that  only  perfect  solutions  will  be  used 
in  the  experiments. 

4)  Changing  the  test  space  from  to  significantly  changed  the 

performance  of  the  method.  The  case  |\|  >1  has  a  symmetric  "major" 

part  while  the  case  |\|  <  1  has  a  nonsymmetric  "main"  part.  If  the 
symmetric  part  is  the  "major"  one,  then  usually  (as  in  this  case)  it 
is  desirable  to  select  the  same  trial  and  test  spaces,  while  if 

the  nonsymmetric  part  in  the  main  one  (|k|  <  1),  then  different  trial 
and  test  space  are  usually  recommended. 

Example  2.2.2  Consider  the  problem  and  the  method  introduced  in 
Example  2.1.2  and  2.1.2*. 

°1  1 

a)  Consider  first  the  choice  H  =  =  H  ,  X  =  X  =  H 

For  any  u  €  we  easily  see  that  the  equations  defining  u^  =  u(M  )  ,  namely, 

r 

u  €  S 
n  n 

< 

B(u  v)  =  B(u,v)  ,  for  v  €  V  , 
n  n 

v 

are  equivalent  to 


u  €  S 
n  n 


1 

u'v'dx  =  I  u'v'dx  ,  for  v  f  V  , 


-1 


-1 
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i*e*  ’  •  is  th«  orthogonal^proj action  of  u  into  Sn  with  respect  to 

the  inne-  '  Product  <u,v)  -  J  u'v'd*  on  ^  .  It  is  lmiediate  fr„. 


this  that 


C(H  ,S  X  )  5  V2  . 
i  n  1 


Thus  all  u  6  are  K-perfect  with  K  =  V2 

h,  Now  we  turn  to  the  second  choice  of  space  considered  in  Example  2.1.2 
and  2.1.--'*,  and  namely  H  =  =  Ljll)  and  X  -  X2  -  L  (I) 

For  any  u  €  ,  we  can  write 


(2.2.13) 


u  =  I  b.p. 


where  th&  P±  are  the  Legendre  polynomials  and 


(2.2.14) 


h  -  21+1 

bi - n  up  dx 

1 


We  note  t?aat 


-  l  yrrr 
X2  i=0  21+1 


u  can  al  uo  be  formally  written  as 


(2.2.15) 


u  =  I  d;(P- 


where  o>1  *  P1+1  - 
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(2.2.16) 


di=-bo 


d2=  -bx 


.  dl-l  dl+l~  bi  *  1  2,3> 


We  will  derive  the  formula  for  C  (u,S  ,X«)  in  the  special  case  when 

n  l 

u(x)  is  even  with  respect  to  0  .  In  this  case  =  0  for  i  odd 

and  d^  =  0  for  i  even.  At  first  we  assume  u  is  a  polynomial  which 
satisfies  the  boundary  conditions  u(+l)  =  0  .  Then  both  series  expansions 
(2.2.13)  and  (2.2.15)  terminate  after  a  finite  number  of  terms. 

Clearly 


z  (u,S  ,X  )  =■ 
n  2 


where  s  is  the  projection  of  u  into  in  the  space  ,  i.e.,  where 

A 

s  is  characterized  by 


s  €  S 


sv  dx  = 


uv  dx 


for  v  €  S 


From  the  basic  properties  of  Legendre  polynomials  we  see  that  degree  of 
tp  is  i  +  1  and  cp^(+l)  =  0  .  Thus  cp^ , . .  .  and  moreover  it 

is  easy  to  see  that  , .  .  .  form  a  basis  for  .  We  write  s  in 


the  form 


n-1 

s  =  l  (di+zi)(pi 


attempt  to  find  the  z^  .  Since  u  is  even  we  have  d_^  =  z^  =  0 
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Now 
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I  p|  - 


fl  V  2 

l  zi(pi+rpi-i>l  dx 


-i  i=i 


n-l  n-1  rl 

Ji  4  zs  J.j  (pj+i'pj-i)(pi+i'pi-i)dx 


Using  the  defining  equations  for  zi» • • • »z2k-l  we  thus  have 


(2.2.21)  |p| 


d„.  . .  z. 


2d2k+ik(2k-1) 


X2  4k+l  2k+l  2k  1  (4k+l)(2k2+3k+l) 


Writing  a  in  the  form 


0  =  ji2k  d3(Pj+l  P3-l}  d2k+lP2k  +  b2k+2jP2k+2j 


we  see  that 


(2.2.22) 


— - —  d2  +  2  T  — —1A1 
4k+l  °2k+l  ^  4k+4 j+1 


2k±2i 


Finally  we  see  that 


(2.2.23) 


-1 


90  dx  “  4k+l  Z2k-ld2k+l  lpI 


Thus,  combining  (2.2.20) 
(2.2.4) 

Z2(u,Sn,X2)|u|22  =  |u-s|2 


(2.2.21),  (2.2.22),  and  (2.2.23)  we  get 


2  M2k-1)  2  - 

(4k+l)(2k2+3k+l)  4k+1  2kH  j=l 

2d2  co  b2 

2k+l  2  y  P2k+2  j 

2k2+3k+l  j=l  4k+Ai+1 


b2k+2j 
4k+4 j+1 
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Next  we  calculate  ju-u  |  .  In  terms  of  the  d 

n  *2  l 

can  ;:ive  a  simple  formula  for  ,  namely 


and  <p  we 


(2 . 2..25) 


n-1 

Un  "  I,  di  ' 


i=l 


To  seis  this  observe  that 


u*  =  y  d.  «p: 

i-i  1  1 


=  l  d  (2i+l)p 
i=l  1  1 


n-1 

u '  =  I  d  (2i+l)p 
i=l 


and  hence 


'-1 


(u'-u')<p!dx 
n  3 


=  -  J  (u-un)<p^'dx  =  0  ,  j  =  1, . . .  ,n-l 


From  (2.2.15),  (2.2.22),  and  (2.2.25)  we  get 


oo 

i  r 

,2  i 

,2 

!  1 

di  ^i ' X„  =  lG' 

lx. 

i=n 

2 

h2 

2 

d2  +  2  y 

b2k+2i 

4k+l 

d2k+l  2  .4 

3  =  1 

4k+4 j+1 

Finally  combining  (2.2.24)  and  (2.2.26)  we  have 
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u-u 


n'X, 


(2.2.27) 


C  (u, S  ,X  )  = 
n  2 


I  -  2 

lu-s'x. 


oo  h 

2  2  ,  o  r  P2k+2j 

4k+l  2k+l  ^  4k+4  j  +1 


0  A2  v  2 

2  n  oo  n 

a2k+l  .  r  °2k+2j 
2kWl  j=l  4k+4j+1 


j2 

d2k-H 

4k+l 


+  A 

2K4-1 


.2 

d2k+l 

2k2+3k+l 


+  A 


2k+l 


where 


co  b2 

A  _  y  2k+2j 
2k+l  4k+4j+l 

and  n  =  2k  .  We  have  established  (2.2.27)  for  any  even  polynomial  satisfying 
the  boundary  conditions.  A  simple  limiting  argument  establishes  it  for  any 
even  u  €  L^I)  • 

It  is  immediate  from  (2.2.27)  that 


(2.2.28) 


CjVft  5  C(X2,Mn,X2)  <  C  VG 


and  so 


C(X  M,X  )  =  +  »  . 
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From  (2.2.27)  we  also  see  that  if  u  is  such  that 


-f±i<KA2kfl  .  for  all  k. 


then  u  is  vK+l  -  perfect.  For  the  particular  choice  u  =  u  =  1  ,  we  get 


b.  =  0  ,  i  >  1  and  we  see  that 
3 


(2.2.29)  C(u-t,n,X2) 


By  compairing  (2.2.28)  and  (2.2.29)  we  see  that  u  belongs  to  the  set  of 
most  imperfect  solutions.  It  is  easy  to  see  that 


Z(u-W  lu|X2  5  n 


and  hence 


5  C  n~l/2 


|U-Un"x2  -  °  n 


Thus  the  method  converges  for  u  although  C(u,S  ,X^)  +  “  . 

oo 

If  we  define  u  by  u  =  £  b.p.  where  b.  =  i  ,  with  0  <  e  <  1/2  , 

i=0  1  1  1 

then  we  see  that 

2  2  ^+1 
I  1 2  ~>  Q±1 

'""Un'X2'-  2n+l 

2(bn+b„+- • -+b  )2 
u  i  n 


(1+2  C+4  C+...+n_E)2 


(?  -~e): 


>  2  - -- 

2n+l 

2-2c 

=  n _ « 

2 (2n+l )  / 
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This  illustrates  the  fact  that  the  method  could  diverge  even  though 
Z(u,Sn>X)  0  .  Although  the  method  is  not  robust,  we  see  that  the  optimality 
constant  grows  relatively  slowly  with  n  (C  ~  \^i)  . 

Let  us  now  summarize  some  main  points  we  have  seen  on  this  example. 

°1  1 

1)  The  method  =  (H  , H  ,F,B,a)  is  quasi-optimal  but  the  method  j 

°1 

=  (H  .L^.F.B.a)  is  not  quasi-optimal,  although  computationally  the  methods 
are  identical. 

2)  The  set  of  perfect  solutions  for  the  method  M ^  is  relatively 
large  and  very  likely  the  method  would  work  well.  The  performance,  in  fact,  uoul 
be  good  unless  the  solution  under  consideration  were  one  of  the  relatively 
rare  imperfect  solutions.  The  performance  can  be  good  also  if  the  solution 
is  imperfect  since  for  reasonably  smooth  solutions  u  satisfying  the  boundary 
conditions,  ZCu.S^.L^)  -*  0  more  rapidly  than  (VrT)  -*■  00  .  (Recall  that 

Vn  gives  the  growth  of  the  optimality  constant.) 

2.3.  Stability  and  the  Stability  Condition 

We  have  seen  that  the  optimality  constants  C(u,M,X),  C(H,M,X),  and  C(H,M,X) 
play  essential  roles  in  understanding  variational  methods.  The  notion 
of  K-perfect  solutions  is,  in  fact,  defined  in  terms  of  C(u,M,X)  .  We  have 
also  seen  in  Examples  2.2.1  and  2.2.2  that  the  calculation  or 
estimation  of  optimality  constants  can  be  rather  subtle.  It  is  important 
to  be  able  to  estimate  these  constants  for  wide  classes  of  problems.  Toward 
this  end  we  introduce  the  notion  of  the  stability  constant  which  is  often 
easier  to  estimate  and  in  terms  of  which  we  can  estimate  the  optimality 
constant s . 

Let  M  =  (H,S,V,B)  be  a  simple  variational  method  and  let  H  c  X  and 
H  c  H  .  For  u  (  H  define 

| P(u+s) | 

D (u ,M, X)  =  sup  - - — 

sfS  I u+s I 


(2.3.1) 
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(2.3.2) 


D(H,M,X)  =  sup  D(u,M,X) 
u€H 


D(u,M,X)  is  called  the  stability  constant  of  M  on  u  with  respect  to 
X  and  D(H,M,X)  is  called  the  stability  constant  of  M  on  H  with 
respect  to  X  .  For  a  directed  variational  method  M  we  define 
D(H,M,X)  =  sup  D(H,M,X)  .  D(H,M,X)  is  called  the  stability  constant  of  M  on 

mcM 

H _ with  respect  to  X  .  We  say  M  is  stable  on  H  with  respect  to  X  if 

D(H,M,X)  is  finite.  We  now  show  the  relation  between  the  stability  and  optimalit 
constants . 

Theorem  2.1  The  stability  and  optimality  constant  satisfy 


(2.3.3) 


D(u,M, X)-l  5  C(u,MX)  5  D(u,M,X)+l  , 


(2.3.4) 


D(H,M,X)-1  <  C(H,M,X)  5  D(H,M,X)+1 


Proof.  We  easily  see  that 


|u-Pu|x  =  | (u-s)  -  P(u-s) 


<  lu_slx  +  |P(u-s) Ix 


{ 14D(u,M,X) 1 j  u-s I 


for  any  s  f  S  ,  which  proves  the  second  inequality  in  (2.3.3).  For  any 
s  €  S  we  have 
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|p(u+s)|x  | (u+s)  -  P(u+s)|x  +  lu+slx 

|u+s|x  lu+slx 

u~Pu| 

5  1+  - - 

|u+s|x 

u-pul 

5  1+ - - 

infju-s j 
s€S 

=  1  +  C(u ,M,X)  , 

which  proves  the  first  inequality  in  (2.3.3).  (2.3.4)  follows  immediately 

from  (2.3.3). 

We  see  that  a  directed  variational  method  is  quasi-optimal  if  and 
only  if  it  is  stable. 

Let  us  now  turn  to  a  further  discussion  of  stability.  Suppose  H  is 
closed  in  X  .  We  easily  see  that  D(H,M,X)  =  ||p||  ,  the  operator  norm  of 

P  .  In  Section  2.1  we  introduced  the  idea  of  a  computational 
variational  method  as  a  sequence  of  simple  variational  methods  = 

(H, S^.V^.B)  selected  from  a  directed  variational  method  M  =  ,F  ,B,a) 

Assume  we  are  given  such  a  sequence  and  suppose 

(2.3.5)  D(H,M  ,X)  =  ||P(Mn)||  ->  -  as  n  ->•  -  . 

We  will  now  show  that  this  implies  that  there  is  a  z  €  H  such  that  the 
method  does  not  converge  to  z  . 

Theorem  2,2  Suppose  (2.3.5)  holds  and  suppose  H  is  closed  in  X  . 

Then  there  is  a  z  €  H  such  that  P(M  )z  -*■  z  . 

n 

Proof.  Suppose  the  method  converges  for  all  u  €  H  ,  i.e., 

|P(M  )u  -*•  u|„  -*■  0  as  n  -*  00  for  all  u  €  H  .  This  implies  P(M  )u  is 
n  X  n 

bounded  for  each  u  .  The  uniform  boundedness  principle  [6,  p.  190]  then 
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1 

* 

i:  ' 


« 


« 


implies  that  jj P (Mn ) II  is  bounded  in  n  .  This  would  contradict  the 
hypothesis  (2.3.5). 

Theorems  2.1  and  2.2  show  that  methods  which  are  not  quasi-optimal  are  not 
robust  in  the  sense  that  there  exists  solutions  for  which  the  methods  do 
not  converge.  It  should  be  noted,  however,  that  the  solution  z  in 
Theorem  2.2  could  be  very  "wild"  or  "irregular"  and  the  computational 
variational  method  might  still  converge  for  a  large  class  of  u 
in  H  . 

Let  us  note  a  further  significance  of  D  .  Suppose  u  €  H  and  s  €  S 
satisfies  Z(u,S,X)  =  |u-s|^  .  Set  £  =  Pu-s  .  We  easily  see  that 

felx  =  ,Pu_s,x 
=  |p(u-s)|x 

5  D(u,M,X)|u-s|x  . 


Thus  D  is  a  magnification  factor  relating  |^[Y  and  |u-s|v  .  Usually 
we  replace  D(u,M,X)  by  D(H,M,X).  It  should  be  noted  that  this  replace¬ 
ment  could  lead  to  a  rather  pessimistic  estimate. 

We  noted  earlier  that  the  stability  constants  are  often  easier  to 
estimate  than  the  optimality  constants.  We  will  now  show  how  the  stability 
constantscan  be  estimated  in  terms  of  the  bilinear  form  B. 

Theorem  2.3.  Suppose  the  space  V  is  furnished  with  a  norm  |v 
Assume 

1)  for  any  u  €  H  there  is  a  constant  C(u)  such  that 


'V 


(2.3.6)  |B(u+s,v)  i  C(u) [u+s Jx |v ,  for  all  s  €  S  and  v  €  V 


and 


(2.3.7) 


2) 


inf 

s€S 

|s|x=1 


sup  | B (s , v)  |  =  y(s  ,V)  >  0 
v€V 


Hv=1 


Then 


(2.3.8) 


D(u,M,X) 


s  1 C(u) 1 
Y  (S  ,V  ) 


for  all  u  €  H  . 

Proof.  From  (2.3.6)  and  (2.3.7)  we  have 


y(S,V)  |P(u+s)  |  <  sup  | B (P (u+s) ,v) I 

X  v€V 

Mv-i 


=  sup  |B(u+s,v)| 
v€V 

lv  lv=l 

£  C(u)|u+s|x 


for  all  s  €  S  ,  from  which  (2.3.8)  follows. 

Remark:  If  (2.3.6)  holds  for  all  u  C  H  with  C(u)  replaced  by  C(H)  , 
then  (2.3.8)  holds  for  all  u  €  H  with  C(u)  replaced  by 
C(H)  ,  i.e., 


D(H,M,X) 


<=  C(H) 

"  y(s ,v) 


We  can  also  estimate  D(H,M,X)  below  by  l/y(S  ,V  )  provided  certain 
additional  assumptions  are  made.  Suppose  H  and  V  , furnished  with  the 
norms  I 'lx  anc*  I  *  I  (/  *  respect  ively  ,  are  a  pair  of  Hilbert  spaces  or 
reflexive  Banach  spaces,  B  is  a  bilinear  form  on  H  x  1/  ,  and  S  c  H  , 


V  c  1/  are  a  pair  of  finite  dimensional  subspaces.  For  v  €  V  we  set 
Ivl..  =  I  v I . .  .  Now  assume 


E 

* 

■» 

\ 

i 

i 


i 

■«* 


(2.3.9) 

| B(u,v) | 

_ X 

3 

U 

VI 

for  u€H,v€l/, 

(2.3.10) 

inf 
u  (H 

Mx= 

sup 

v€l/ 

i  Mv-i 

|  B(u, v) | 

hi 

e 

V 

o 

and 

(2.3.11) 

sup 

u€H 

| B(u, v) j 

o 

A 

for  0  v  (  V  . 

We  note  that  in  the  presence  of  (2.3.9),  (2.3.10)  and  (2.3.11)  are  necessary 
and  sufficient  for  the  variational  problem 

u  €  H 

B(u,v)  =  F(v)  ,  for  v  €  1/ 

to  be  uniquely  solvable  for  each  bounded  linear  functional  F  on  \J  . 

In  this  situation  we  also  have 

u|  S  co  1  sup  l  =  co_1  ||  F || 

X  v€V  lvl 

This  result  is  proved  in  [],  p.112] 

Theorem  2.4  Suppose  (2 . 3. 9)- (2 . 3 . 11 )  hold.  Let  u  €  S 
and  set 


(2.3.12) 


sup 

vfl/ 


|  B  (  u ,  v  ) 


s  Q(u)  . 


Then  there  exists  a  u  €  H  such  that  P(M)u  =  u  and 


'  Qfu)  ‘ 


(2.3. 13) 


D(u,M,X) 


Proof.  Obviously  F(v)  =  B(u,v),  v  €  V  ,  defines  a  bounded  linear  functiona 
V  .  It  is  possible  to  show  F(v)  can  be  extended  to  a  bounded  linear 
functional  on  U  with 


sun  |F(v) | 
v€l / 

MlT1 


sup  | F (v) 

Mr1 


=  Q(u) | u | 


This  is  the  content  of  the  Hahn-Banach  Theorem  [6,  p.  134]. 

As  noted  above,  from  (2 . 3. 9)-(2 . 3. 11)  it  follows  there  is  a  u  €  H 
satisfying 


B(u,v)  =  F(v)  ,  for  all  v  €  [I  , 


Q(u) | u | 


(2.3.14) 


It  follows  immediately  from  the  definition  of  F  that  P(M)u  =  u 
Using  (2.3.14) and  the  definition  of  D(u,M,X)  we  thus  have 


I Pu  lx  I u  !X  co 

D(u,M,X)  >  j  j  =  -j —A  ?  . 

u  u  Q(u) 


This  completes  the  proof. 

Theorem  2.3.  Suppose  (2.3.7),  (2.3.9)  (2.3.11)  hold.  Then 


(2.3.15) 


D(H,M,X) 


Y(S  ,V) 


Proof.  This  result  follows  Immediately  from  Theorem  2.4  and  the 
definitions  of  D(H,M,X)  and  y(S,V)  . 
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Let  us  now  elaborate  on  Theorem  2.4.  It  shows  that  functions  u 
with  D(u,M,X)  large  should  be  sought  among  functions  whose  projection 
u  =  P(M)u  have  small  Q(u)  .  Of  course  not  every  u  €  H  satisfying 
P(M)u  =  u  leads  to  large  D(u,M,X)  ,  as  shown  by  the  fact  that  for  u  €  S  , 
Pu  =  u  and  D(u,M,X)  =  1  .  If  we  define 

(2.3.16)  R(u)  =  {w<EH:P(M)w=u> 

then  for  u  €  R(u)  ,  D(u,M,X)  '.11  be  largest  for  that  u  with  smallest 

|u|v  .  Thus  if  we  define 
A 

W(u)  =  inf  lwlv 
wCR(u) 


we  see  immediately  that  there  is  a  w  €  H  such  that 


D (w ,M, X) 


W(u) 


Obviously  every  choice  of  u  €  S  and  u  i  R(u)  lends  to  a  lower  bound 
for  D(H,M,V) : 


D(K,M,V)  > 


This  gives  a  practical  tool  for  estimating  D(H,M,X)  from  below. 

Condition  (2.3.7)  is  easily  seen  to  imply  the  regularity  of  the 
bilinear  form  B  on  S  ‘  V  . 

A  directed  variational  method  M  =  (h  ,  X ,  t* ,  F ,  B ,  u )  is  said  to  satisfy 
the  stability  condition  if 


(2.3.17) 


y(S,V)  •  y  >  0 

I 


for  all  (S,V)  €  F  . 


Tills  condition  is  sometimes  also  called  the  inf-sup 
condition,  or  the  LBB  or  RB  condition.'*'  Theorems  2.1  and  2.3  show  that 
when  (2.3.17)  holds,  then  D(H,M,X)  is  finite  and  the  method  is  stable. 

An  especially  important  situation  occurs  when  H  =  1/  ,  F  is  the 
family  of  pairs  (S,V)  with  S  =  V  and 

(2.3.18)  B(u,u)  >  y|u|£  ,  Y  >  0 

for  any  u  (.  S  ,  (S,S)  €  F  .  Then  y(S,S)  >  y  .  In  this  case  we  say  that 
the  form  is  coercive. 

Suppose  M  =  M(S  ,V  ) , (S  , V  )  €  F  ,  is  a  computational  variational 
n  n  n  n  n 

method  for  which  y(S  ,V  )  -*■  0  as  n  -*•  "  .  In  addition  suppose  M 

n  n  n 

satisfies  the  hypotheses  of  Theorem  2.4  for  each  n  .  Then  Theorem  2.4  shows 
that  there  is  a  sequence  u^  €  H  such  that 

D (u  ,M  ,X)  -*•  »  . 
n  n 

Theorem  2.4  does  not  show  the  existence  of  a  u  6  H  such  that  u(M  )  /  u  . 

n 

This,  however,  follows  from  Theorem  2.2. 

Combining  this  observation  with  Theorem  2 . 3  we  see  that  the  stability 
condition  (2.3.17)  is  necessary  and  sufficient  for  quasi-optimality  of  the 
directed  variational  method  A)  =  (H  ,  X,  l/,  F ,  B ,  a)  .  The  condition  is  necessary 
in  the  sense  that  if  it  falls  then  there  will  exist  at  least  one  u  €  H 
for  which  the  method  fails.  As  we  have  noted  above,  however,  the  method 
may  still  perform  well  for  all  solutions  of  major  practical  interest. 


This  terminiology  relates  to  [1],[5]  where  this  condition  was  first  introduced 
in  connection  with  the  analysis  of  the  finite  element  methods. 


Various  statements  in  the  literature  that  certain  mixed  methods  work  well 
inspite  of  the  fact  that  the  LBB  (BB)  condition  is  violated  have  to  be  understood 
in  light  of  this  observation. 

Let  us  return  now  to  the  problems  and  methods  introduced  in  our  previous 
examples  and  apply  the  results  discussed  in  this  section. 

Example  2.3.1  (Cf.  Examples  2.1.1,  2.1.1*,  and  2.2.1). 

Suppose  A(x)  in  (2.1.3)  is  of  the  form 


A(x)  =  Aq  +  B (x) 


where  A^  is  a  constant  invertible  matrix  satisfying 


II Aq  B(x)  ||  5  q  <  1 


where  (|*||  denotes  the  matrix  norm  associated  with  the  Euclidean  vector 
norm.  Take  X  =  H  =  (L^CI))^  and  let  1/  =  (L^d))^  as  before 
It  is  immediate  that 


B(u,v)  5  C|u|  | v| y  ,  for  u  £  H  ,  v  €  V 


with  C  =  2  max  ||a.  .  (x)||  ,  .  .  Given  u  €  H  ,  let  v  =  (A  )u  .  We 

.  ,  11  L  ^i/  U 

easily  see  that 


B(u,v)  =  uA  (A_+B)u  dx 

J-TT 


r  1! 

=  u(l+A  ^B)!!  dx 

J  -ii  u 


I u I v (i-q) 


and 
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iviy  2  Q|u|x 

where  Q  =  ||(Aq^)^||  ,  from  which  we  obtain  (2.3.10)  with 

to  =  >  0  ■ 

Thus  conditions  (2.3.9)  and  (2.3.10)  hold  for  this  example.  Condition 
(2.3.11)  is  also  easily  seen  to  hold.  (Note  the  assumption  that  A^  is 
a  constant  matrix  was  not  used.) 

Our  main  goal  is  to  show  that  (2.3.7)  holds.  In  the  previous  paragraph 

we  have  proved  the  infinite  dimensional  analogue.  The  proof  of  (2.3.7)  is 

similar  but  we  use  now  the  fact  that  Aq  is  constant  matrix.  For  _u  €  S  , 

-1  T 

let  v  =  (Aq  )  u  .  Then  v  €  V  since  Aq  is  a  constant  matrix.  The 
above  estimate  thus  shows  that 

Y( S,V)  >  ^  >  0  . 

Therefore  we  see  that  the  assumptions  in  Theorem  2.3  are  satisfied  and, 
moreover,  the  stability  condition  (2.3.17)  is  satisfied  with  y(S,V)  ^/Q  . 
Although  the  quasi-optimality  of  the  method  under  our  assumptions  could  be 
proven  by  the  approach  and  in  Section  2.2,  obviously  the  approach  used  here 
is  much  simpler  and  can  be  used  in  many  situations. 

Example  2.3.2  (Cf.  Examples  2.1.2,  2.1.2*,  and  2.2.2.) 

°1 

Consider  the  problem  introduced  in  Example  2.1.2.  Let  H  =  =  H  , 

1  °  1  1 

X  =  =  H  ,  and  V  =  H  with  the  II  -norm  as  above  in  case1,  a.  We  obviously 

have 

B(u,u)  >  2  1/2|u|J  ,  for  u  €  , 

i.e.,  B  is  coercive,  and  we  thus  see  that  the  assumptions  of  Theorem  2.3 
are  satisfied  and  hence  the  quasi-optimality  of  this  method  has  again  been 
established.  As  we  have  seen  in  Example  2.2.2  a  similar  analysis  would  fail 
in  case  b) • 


introduced  in 


Example  2.3.3  (Cf.  Examples  2.1.3  and  2.1.3*) 

Let  H  =  ft1  ,  X  =  H1  ,  and  V  =  L2  . 

a)  First  let  us  consider  the  space  and  V, 

case  b)  in  Example  2.1.3.  Then  (2.3.6)  and  (2.3.7)  are  satisfied  and  as 
before  we  conclude  that  the  method  is  quasi-optimal  and  stable. 

b)  Now  consider  the  case  of  test  spaces  V  introduced  in  case  a). 

.L,n 

Our  goal  is  to  show  that 

(2.3.19)  Cjn  <  D(tf,Mn,X)  <  n 


where  C,  ,  C0  >  0  and  independent  of  n  and  with  M  =  M(S  ,V,  )  . 

J-  ^  n  n  1  ,n 

We  will  prove  the  left  side  of  (2.3.19)  by  applying  Theorem  2. A. 

Toward  this  end  we  construct  a  particular  u  and  estimate  Q(u  )  ,  as 

n  n 

defined  in  (2.3.12),  from  above.  Let  u  €  S  be  defined  by 

n  n  1 


(2.3.20)  u’(x)  =  (-1) 

n 


J  3-1/2 


X.  ,  <  X  <  X, 

3-1  j 


j  =  1,2, ... ,n  ; 


the  graph  of  u'  is  shown  in  Fig.  1. 


Fig.  1  Graph  of  the  function  u  . 


Note  that 


(2.3.21) 


Vx  iC>0 


Now  let  \J/_ , .  . .  ,\lr  ,  €  V,  denote  the  basis  functions  defined  by 
T0  Tn-1  l,n  } 


fn(x)  =  1-nx  ,  0  <  x  <  x1 


^(x)  = 


nx-i+1  ,  x.  ,  <  x  <  x. 

l-l  i 


-nx-t-i+1  ,  x^  <  x  <  x^+^  ,  i  =  l,...,n-l 


Then  we  have 


(2.3.22)  B  (u  ,ijr . )  = 
n  l 


(  n1"1  ~2 

(-1)  ti 


j  l  —  lj...|n**l 


1  -2  ,  n 
-  T  n  »  1  =  0. 


Any  ^  €  V_  can  be  written  in  the  form 
l,n 

n-1 

*  =  l  Cit± 


and  we  easily  see  that 


If1  2  2  2  1  _  ,  1 2  1  r  1  2  2  2  1 

3^LrO+cl+"-k„-lJ'  l*lV  =  n  L  2t;0+cl+'”+cn-lJ 


Therefore 


|B(u  ,i|r)|  *=  I  l  <  iB(un  .'Ml 

i=0  n 


f n-l  c  .  (-D; 


(2.3.23)  Q(u  )  2  n  S  Cn 

n 

Since  (2.3.10)  holds  with  co  =  —  ,  we  see  from  Theorem  2.4  that  D  >  Cn 

3/2 

This  proves  the  left  side  of  (2.3.19). 

Before  turning  to  the  proof  of  the  right  side  we  will  construct  a 

particular  w  €  R(u  )  which  vrill  play  the  role  of  u  in  Theorem  2.4. 

n 

We  begin  by  constructing  £_(x),...,£  ,  (x)  so  that 

u  n~T 


£^(x)  =  0  for  Ix-x^J  >  1/n 


rx 


(2.3.24)  J 


i+1 


i-1 


ei+j  d* 


_  wj 


i-1 


ij  ’ 


i,j  =  0 . n-1  ,  not  both  0 


V 


Vo  dx  = 


zl 

2n 


where  6^ 
in  Fig.  2. 


f0  ,  i  ^  j 
,  i  =  j 


Specifically  we  choose  £^(-1)*  ^  as  shown 


Fig.  2.  The  graph  of  the  function  . 


Then  we  define 


i 

(2.3.25) 


P(x) 


_1_ 

2n 


n-1 

I 


i=0 


Ci(x) 


and 


(2.3.26) 


w(x)  =  [  p(t )dt  . 
J0 


It  is  easy  to  see  that 


(2.3.27) 


From  (2.3.22),  (2.3.24)  and  the  definition  of  w  we  have 

1  _  f1 

B(u  ,\K)  =  unt,-  dx  =  w’t.  dx  =  B(w  ,  )  ,  i  =  0,...,n-l  , 

1  0  J0 


which  shows  that  P  w  =  u  .  Using  (2.3.20)  and  (2.3.26)  we  thus  have 

n  n 


lpn(w)l; 


i  C  n 


w 


which  was  our  goal . 

Let  us  remark  that  un  is  not  unique.  It  is  essential,  however,  that 

it  is  what  we  referred  to  earlier  as  a  wild  or  irregular  function.  It  is 

also  of  interest  to  note  that  the  function  u  ?  S  defined  by 

n  n 


u’(X)  =  (-l)j 
n 


xj_l  <  x  <  xj  » 


will  not  lead  to  the  desired  result  since  we  could  only  prove 
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We  will  now  prove  the  right  side  of  (2.3.19)  by  applying  Theorem  2.3. 
Le*  '  u  €  Sn  and  set 


w^  =  u'(x^-l/2n)  ,  i  =  1,2,. ..,n  . 


Theen  select  v  €  V.  so  that 

l,n 


V(x.) 


wheere 


Vj  =  w±-Hri+1  *  1  =  1 . n-1 


V0  =  W1  ’  vn  =  °  * 


We  easily  see  that 


B(ii,v)  =  2^-  (w1 , . . .  ,w  ) 


/  2  1 
12  1 


\ 


12  1 
1  1 


/  “A 


\;l 


Consider  the  auxiliary  (2n-l)  *  (2n-l)  matrix 


2  1 
1  2  1 


\ 

and  the  associated  quadratic  form 


0 

.  1 
•  • 

2  1 

1  2 


The  eigenfunctions  are  sin  k  ,  k  =  l,...,2n-l  ,  and  thus  it  is  easily 
seen  that  the  least  eigenvalue  X^(n)  of  A  satisfies 


X^(n)  >  Cn 


(2.3.26) 


?  2n_1  2 
$(z)  >  C  n'Z  l  z. 

i=l 


Setting 


we  see  that 


B  (  u ,  v )  =  ~  <*(z) 


and  therefore  from  (2.3.26)  we  obtain 


(2.3.27) 


B(u,v)  >  C  n  ^(2w?+* * *+2w^  .+w  ) 
1  n— 1  n 


-  J  6  i 
>  Cn  )  w . 


We  also  see  that 


lv  lj/  —  C  n_1  l  v* 

V  i=0 


(2.3.28)  < 


=  C  n  ^  [w2+(w1+w->)2+*  *  *+(w  ,+w  )  ] 

1  12  n-1  n 


=  C  B(u,v)  . 


Combining  (2.3.27)  and  (2.3.28)  we  obtain 


lifeviL  >C|B(U,V)|1/2 


>C  n  1 1  u  | , 


from  which  we  get 


Y(S  ,V  )  = 
n  1  ,n 


inf  sup 

u€S  v^V, 
n  1  ,n 

lUlX=1  ' 


| B(u,v) |  2  C  n 


-1 


lVll/=1 


Since 

|  B(u, v) |  S  |u|x  |V 1^ 

for  u  €  H  ,  v  £  V  ,  thus  from  Theorem  2 . 3  we  have 

D(u,M^,X)  1  C  n 

for  u  i  H  .  This  proves  the  right  side  of  (2.3.19). 

Next  we  show  that,  although  D(H,M  ,X)  is  large,  the  method 

n 

still  works  well  for  smooth  solutions.  Specifically  we  will  prove  that 


,3 


for  u  €  C  (I)  ,  where  u  =  P(S  .V,  )u  and  h  =  1/n  .  Let  u.  =  u  (x.) 

n  n  i,n  i  n  1 

It  is  easy  to  see  that  the  finite  element  equations  reduce  to 


u0  =  ° 


u„-u 


(2.3.30)  < 


1  0 


f  dx 


ui+ru- 


i-l  f  i+l  , . 

-  ft. 

Y 


dx  ,  i  =  1 , .  .  . ,n-l  , 


i-l 


where  ,  are  the  basis  functions  for  V,  introduced  earli 

f 0  Tn-1  l,n 


xer 


3  —  2  — 

Since  u  €  C  (I)  ,  f  €  C  (I)  and  we  can  write 


(2.3.31) 


X1 

fXl 

f  tn  dx  - 

f  dx 

X 

o 

( 

xo 

Xi+1  ,  ,  , 

’Xi+1 

f  f-  dx 
Ti 

Xi-1 

Xi-1 

f  dx  =  a.  h" 
l 


i  =  1 , . . . ,n-l  , 


where 


(2.3.32) 


|a  |  <  C|f| 

(T(I) 


Now,  using  (2.3.30), 


[ui+ru(xi+i)1  ~  [ui-ru(xi-i)]  =  2 


i+l 


rx 


ft-  dx  - 

T  l 


i+l 


f  dx  ,  i  =  1  , 


'i-l 


'i-l 


and  so,  setting  =  u.-u(x.)  ,  (2.3.30)  implies 
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(2.3.33) 


Ki+l~h-l  '  Qih  *  1  "  1**",n"1  * 


^0  =  °  * 


From  (2.3.32)  and  (2.3.33)  we  obtain 


|g. |  £  C ) f |  ?  _  h2  ,  j  even 

2  <T(I) 


kJ  s  cjf|  _h2+  k  |  ,  j 

3  <T(I) 


odd  . 


Using  (2.3.30)  and  (2.3.31)  again  we  have 


T  1  1  2 

F  «  u  -u(x  )  =  2  f  tr,  dx  “  f  dx  -  aQh 

t  -l  -*•  n  u  n 


k-l  -  C|f|  „  _  h  ,  j  =  0,1,... ,n 
3  C  vl) 


with  C  independent  of  n  . 

Letting  T  u  denote  the  S  -interpolant  of  u  we  have 
n  n  n 


(2.3.34) 


l’J-Un!X  S  lu“InulX  +  iTnU-Unfx 


2  Chu  -  _  +  7nu-u  I 

c3(i)  n  n  x 


we  further  see  that 


7  u(x  )-u  (x  )|  =  |r  |  5  Ch  |u| 
n  1  n  i  i  CJ(I) 
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Krom  this  we  get 


'V-“n  V  Ch  MC3(T) 


and  thus,  using  the  fact  that  Js  |  2  CnlsL  for  s  €  S  (the  so-called 

A  n 


inverse  assumptions  for  S^),  we  have 


(2.3.35) 


I  u-u  |  5  Chiu  I  « 

-  "  x  c3d> 


Combining  (2.3.34)  and  (2.3.35)  we  have 


I u_un I v  ~  Chu  . 

n  x  c3(i) 

This  is  the  desired  result. 


Define  now 


H  =  {u  ;  | u |  _  <  «>  ,  u(0)  =  0  ,  u" (x)  >  a.  |u| 

cJ(D  1  CJ(I) 


or  u"(x)  5  -a.  |u|  _  on  (to,r})  with 

cJ(i) 


|  co-T]  I  =  a2>  ,  a1 ,  a2  >  0  . 


Consider  now  the  method  using  (S^V^  ^)  .  Then  we  easily  see  that 


(2.3.36) 


C(H,M  ,H  )  <  K 


where  K  depends  on  a^,a?  but  is  dependent  of  n  .  Tn  fact  it  is 
easy  to  see  that  for  u  €  H  , 


Cha  a  |u|  __ 

7.<u,S  H1)  » - tti-U. 

I"!*1 


and  (2.3.36)  is  obtained  by  combining  this  estimate  with  (2.3.29). 

Let  us  summarize  some  main  points  shown  in  case  b). 

t2-l 

1)  The  method  is  not  quasi-optiraal  on  H  =  H  ;  nevertheless  the 
instability  is  not  too  strong.  The  analysis  has  been  based  on  results 
relating  to  Theorem  2.3. 

2)  The  method  is  quasi-opt imal  on  the  set  H  ,  i.e.,  all  functions 

belonging  to  H  are  K-perfect.  We  note  that  H  is  not  closed  in  . 

3)  The  result  that  H  is  a  set  of  perfect  solutions  cannot  be 
directly  proven  by  Theorem  2. 2-2. 5.  A  special  analysis  is  necessary. 
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3 .  Fur ther  Examples  of  Finite  Element  Methods 
3.1.  Introduction 

In  this  section  we  examine  three  different  finite  element  methods 
for  the  approximate  solution  of  a  simple  model  problem,  namely  that  of  a 
longitudinally  loaded  bar  on  an  elastic  support.  The  classical  displace¬ 
ment  formulation  of  this  problem  is 


(3.1.1) 


I  Lu  =  -(E(x)F(x)u' (x))'  +  b (x)u(x) 


u(0)  =  u ( 1 )  =  0  . 


p(x)  ,  0  <  x  <  1 


Here  u(x)  ,  0  <  x  <  1  ,  denotes  the  longitudinal  displacement  and  E(x) 
denotes  the  modulus  of  elasticity,  F(x)  the  crossectional  area,  b(x)  the 
spring  constant  of  the  elastic  support,  and  p(x)  the  longitudinal  load. 
We  will  let  a(x)  =  E(x)F(x)  and  assume 


(3.1.2)  0  <  px  5  a (x)  5  p2  ,  0  5  b(x)  <  f2 


but  otherwise  allow  a  and  b  to  be  rather  general  functions.  They 
could  be,  for  example,  constant  functions  (corresponding  to  a  bar  with 
uniform  crossection  and  elastic  properties  and  a  uniform  elastic  support) 
or  step  functions  with  many  steps  (arising  in  the  study  of  composite 
materials,  for  example). 

We  now  cast  this  probem  in  variational  form.  Let 


(3.1.3) 


B(u,v) 


r\ 

•'0 


(au ' v '+buv)dx  . 


Then  we  easily  see  that  (3.1.1)  can  be  formulated  as 


(3.1. A) 


J  u  €  Hl 
1  B(u,v) 


=  { n  :  u  €  ,  u(0)  =  u(l)  =  0} 

1 


pv  dx 


for  nil  v  i  H 


°1 


0 
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B  is  defined  in  H  *  V  ,  where  H  =  \J  =  H  . 

To  complete  the  specification  of  a  directed  variational  method,  in  addition 
to  the  bilinear  form  we  must  select  family  F  of  the  trial  and  test  spaces  S 
and  V  ,  the  space  X  ,  and  the  discretization  parameter  a  .  We  will  present 
three  choices  for  S  and  V  in  the  next  two  subsections. 

3.2.  The  Standard  Finite  Element  Method 

Let  A  =  {0  =  xQ  <  x  <  xn(A)  =  1}  ,  n(A)  5  2  ,  be  an  arbitrary 

mesh  on  [0,1]  and  set  I.  =  I. (A)  =  (x  ,x  )  ,  h  =  h  (A)  =  x.-x  , 

3  J  J  3  3  3  J  J 

and  h  =  h(A)  =  max  hj  (A)  •  Then  set 
0£j£n(A) 


(3.2.1) 

s  =  sA  =  v  =  vA 

=  {s(x)  :  s (x) 

is  continuous  on  [0,1] 

s  (x) 

is  linear  on  each  , 

and 

s (0)  =  s(l)  =  0}  . 

Let  H  =  \J  =  H1  ,  F  =  {(SA>VA)  :  A  any  mesh}  ,  X  =  H1  ,  and  =  h(A)  . 

We  can  now  discuss  the  directed  variational  method 

M  =  (H.H1 ,U,F,B,a)  . 

The  optimality  of  this  method  is  given  in  the  following 

ol  1 

Theorem  3.1  The  method  M  is  quasi-opt imal  on  H  with  respect  to  H  ,  i.e. 

(3.2.2)  CdV.M.H1)  <  C 

with  C  depending  only  on  and  P2  but  independent  of  A  .  Thus 

(3.2.3)  |u-P(S  ,V  )u|  *  C  inf  |u-v|  l  . 

H  u€S  H 

A 

Proof.  This  standard  result  is  proved  by  showing  (2.3.18)  and  using 

O  ] 

Theorems  2.3  and  2.1.  In  fact  for  all  u  t  H  we  have 


S(u,u)  =  ((au 1 )  +  bu  )dx 

J0 

f1  2  P 

>  P  (u''Z 
1  J0 


)‘dx  2  ~  |u|2  . 

H 


°1  1 

As  stated  in  Theorem  3.1,  the  optimality  constant  C(H  ,M,H  )  is 
finite  and  is  bounded  uniformly  with  respect  to  the  class  of  coefficients 
a  and  b  satisfying  (3.1.2).  Regarding  approximality  let  us  state  the 
standard  result  for  the  elements  under  consideration: 


(3.2.4) 


inf  ju-s |  £  Ch ju|  -  , 

/*«  »•»•*- 


s€S.  H 

A 
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where  C  is  independent  of  A  .  Thus  for  any  u  €  H  OH  , 


(3.2.5) 


Ch  |  u  I 

Z(u,S  H1)  £  - — 

A  M  i 

H 


If  our  solution  u  lies  in  H  ,  then  from  (3.2.3)  and  (3.2.4)  we  have 
the  error  estimate 


|u-P(S  V  )u|  £  Ch | u | 

H  H 


We  note  that  if  a’(x)  and  b(x)  are  bounded,  then  we  can  prove  that 


M  o  -  c  Ipi 


L2  ’ 


From  this  and  (3.2.5)  we  obtain 


(3.2.6) 


lu-p(VVulu i 5  c  hlplL. 

H  ti 


C  here  depends  on  the  maximum  of  a’  and  b  . 


(3.2.6)  is  not,  however. 


valid  for  problems  with  rough  coefficients  (coefficients  which  are  step 
functions,  for  example). 


Next,  let  us  consider  the  same  family  of  simple  methods  but  choose 
X  =  L2  .  We  present  a  result  showing  the  resulting  method  is  not  quasi- 
optimal. 

Theorem  3.2.  Suppose  a(x)  =  1  and  b(x)  =  0  .  Then  for  any  constant 


C  >  0  and  any 

°1 

A  there  is  a  u  €  H  such  that 

(3.2.7) 

| u-P(S . ,V  )  |.  >  C  inf  |u-s| 

*  2  ses, 

A 

Thus 

°1 

(3.2.8) 

C(H  ,M(Sa,Va),L2)  =  +  » 

for  any  A  . 

Proof.  Suppose  A  =  (0, 1/2,1)  and  let  u  be  as  shown  in  Fig.  3.1 
with  u(l/2)  =  1  .  It  is  easily  shown  that,  since  the 


P(S^,V^)u  is  the  S^-  interpolant  of  u  ,  as  shown  in  Fig.  3  by  the 
dotted  curve,  |u-P(S,,V  )u|  is  nearly  as  large  as  |P(S  ,V  )u|  =•  1  / \/3 


,  which  is 


and  inf  u-s 


5  I u |  ,  is  nearly  as  small  as  zero.  This 

L2 


s€Sa  2 

completes  the  proof  for  this  simple  mesh.  The  proof  for  a  general  mesh 
is  similar. 

It  is  interesting  to  compare  Theorems  3.1  and  3.2  with  Example  2.2.2, 

In  that  example  we  treated  the  same  boundary  value  problems  but  based  our 

approximation  in  polynomials  instead  of  piecewise  linear  functions.  In 

°1 

Example  2.2.2.  we  obtained  C(H  »M^n,Vn)  ,L2>  5  C\/n  in  contrast  to  the 
°1 

result  C(H  ,M(S^,V^)  =  +  00  obtained  above. 

Theorem  3.3.  Suppose  a'(x)  and  b(x)  are  bounded.  Then  for  any 
2 

u  €  H  we  have 


(3.2.7) 


u-p<WulT.  -  Ch  M  9  2  Ch2 | P | , 


where  C  is  independent  of  u  and  A  but  depends  on  a  and  b  .  If 

H  =  {u  :  u  £  H2  (1  H1  ,  u" (x)  2  a1  |u|  _  or 

tt 

u"(x)  <  -  a  |uj  _  on  (00, r))  where  fco-T)|  =  a„ } 

H  2 

with  ,  a2  >  0  ,  then  any  u  €  H  is  K-perfect  with  K  depending  on 

a^.a^ja,  and  b  .  Thus 

C(H,M,L2)  <  C  . 


Proof.  The  first  part  of  the  result  is  standard  (see  e.g.[  2  ])  and 
the  second  part  follows  from  the  fact  that 


Z(u,Sa,L_)  > 


Ch2  0la2 


^2 


We  consider  now  one  more  choice  for  X  ,  namely  X  =  L 


Then 


the  following  result  is  proved  in  l 3 ] . 

Theorem  3.4.  Suppose  that  a,b  are  sufficiently  smooth.  Then  there 
is  a  constant  C  such  that  for  any  u  €  , 

(3.2.9)  |u-P(SA,VA)u|L  5  C  inf  |u-s| 

CO  s  f  S .  °° 

A 

with  C  independent  of  u  and  A  but  depending  on  an^  maxi~ 

mum  of  the  first  derivatives  of  a  and  b  . 

Thus 

(3.2.10)  C(H1,M,Loo)  5  C  . 


It  is  interesting  to  compare  Theorems  3.1,  3.2,  and  3.4.  The  computationally 

o  X  1 

identical  method  is  quas i-opt imal  on  H  with  respect  to  H  and  L  but 

oo 

is  not  with  respect  to  L ^  . 


3.3.  A  Second  Method  for  Solving  (3.1.4) 


In  this  subsection  we  suppose  b(x)  =  0  .  Here  we  choose 


(3.3.1)  S  =  S^  =  V  =  =  {s(x)  :  s  is  continuous  on  [0,1]  , 

s(x)  is  a  solution  of  (as’)'  =0  on 
each  1^  ,  s(0)  =  s(l)  =  0}  . 

We  again  let  H  =  V  =  ,  X  =  ,  P  =  {(S^.V^)  :  A  any  mesh}  , 

and  a(S^,V^)  =  h(A)  .  We  will  discuss  the  directed  variational  method 

Ol  1  o-i  * 

M  =  (H  ,H  ,H  ,F,B,a)  . 


Exactly  as  in  Theorem  3.1  we  see  that  this  method  is  quasi-opt imal  on  H 
with  respect  to  H*  .  The  methods  differ,  however,  in  regard  to  the 


approximation  properties  of  the  trial  spaces  employed.  The 
approximation  properties  of  the  trial  space  introduced  in  (3.3.1)  is  given 
in  the  following  theorem. 

Theorem  3.5.  There  is  a  constant  C  depending  only  on  (3^  and 
p2  such  that 

(3.3.2)  inf  |u-s|  1  5  ChjLu)  =  Ch|p| 

s  €S.  H  z  L2 

A 

Proof.  See  [4]. 

Combining  this  result  with  the  above  mentioned  quasi-optimality  we 
obtain 

(3.3.3)  |u-P(SA,VA)u|  1  £  Ch|p|L  . 

H  2 

This  estimate  should  be  compared  with  (3.2.6).  They  differ  in  that  while 

(3.3.3)  holds  for  all  a(x)  and  b(x)  satisfying  (3.1.2),  (3.2.6)  holds 
only  for  smooth  a  and  b  .  Thus  we  see  that  the  method  M  considered 
in  this  subsection  is  very  accurate.  Because  of  the  unusual  test  and 
trial  space  it  is,  however,  not  easily  implemented.  The  method  introduced 
in  Subsection  3. A  will  be  as  accurate  as  the  method  discussed  here  while 
being  as  easily  implemented  as  the  standard  finite  element  method  discussed 
in  subsection  3.2. 

3 . A  .  A  Third  Method  for  Solving  (3.1 .  A_)^ 

As  underlined  in  Section  2.2,  the  trial  and  test  spaces  play  very 
different  roles.  The  trial  ; ; ace  is  chosen  for  its  approximation  properties 
and  the  test  space  is  chosen  so  that  the  method  is  optimal  or  nearly  optimal 
so  that  the  method  is  easily  implemented.  The  standard  method,  which 
uses  piecewise  linear  functions  for  both  trial  and  test  spaces  is  optimal 
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and  is  easily  impl  emented .  The  trial  space  has  poor  approximation  properties 
for  problems  with  rough  coefficients,  however.  The  method  discussed  in 
Subsection  3.3  is  optimal  and  has  good  approximation  properties  while  not 
being  easily  implemented  because  of  the  way  the  load  enters  in  the  computation. 
The  method  discussed  in  this  subsection  will  use  the  trial  space  used 

in  the  second  method  and  use  the  test  space  used  in  the  standard 

method.  This  choice  will  simplify  the  implementation,  making  it 
virtually  of  the  same  complexity  as  standard  method,  while  preserving 
the  advantages  of  the  second  method. 

a 

Suppose  as  above  that  b(x)  =  0  .  Let  be  as  defined  in  (3.3.1) 

and  let  be  as  defined  in  (3.2.1).  We  let  H  =  ,  X  =  ,  \f  =  , 

P  =  HSa,V&) }  ,  and  a(S^,V^)  =  h(A),  and  consider  the  directed  variational 


method 


M  =  (H,H,V,F,B,a)  . 


Theorem  3.6.  There  is  a  positive  constant  y  depending  only  on  p^ 
and  P2  such  that 


(3.4.1) 


inf  sup  j  B (S , V)  | 

s€S .  v€V 
A  A 

|s|  1=1  |v|  .=1 

H  H 


■  W  ’  t 


for  all  A 


Proof.  For  s  €  S.  let  s  £  V,  be  defined  by 
A  A 


s(Xj)  =  s(Xj)  ,  j  =  0,1, . . . ,n 


(3.4.2) 


1.  =  ([  a  ^dx)  ^h. 
J  J  l  J 


Then  we  easily  see  that 


as  1 

K'v'iv 

Using  this  relation  we  have 

_  j 

-1  _ 

B(s, s)  =  I 

as  s'  dx 

0 

■  l 

j-i  h3 

P?  f1  2 

<S  ) 

- 1  r2  |s|«i 

and 


Hence 
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Combining  Theorems  3.6  with  Theorems  2.1,  2.3  we  see  that  the  optimality 
constant  C(h\m,H^)  is  bounded  uniformly  over  entire  class  of  coefficients 
satisfying  (3.1.2).  In  addition,  from  Theorem  3.5  we  see  that  we  have  good 
approximation  properties.  Thus 


(3.4.3) 


lu-p(VVul  i  5  chML  5  chfpk 

H  2  2 


with  C  depending  only  on  and  •  Although  estimates  (3.2.6)  and 

(3.4.3)  are  similar,  the  values  of  the  constants  C  in  (3.2.6)  and  C  in 
(3.4.3)  could  be  very  different  if  a(x)  is  not  smooth  (C  «  C)  . 


form 


One  further  point  needs  to  be  considered.  We  are  using  the  bilinear 


B(s,v) 


fl 

as'v'dx  ,  as  in  subsection  3.2,  but  we  are  now  using 

0 


different  trial  and  test  spaces  and  the  trial  space  consists  of  less  simple 
elements.  However  it  is  easily  seen  that  the  stiffness  matrix  is  symmetric 
and  is  as  easily  computed  as  the  stiffness  matrix  for  the  standard  method. 

Theorem  3.7.  Let  ^i’**',<^n-l  t*ie  stant^ar^  basis  for  and 
and  * '  • ’^n-l  be  t'ie  standard  bases  for  §  [cjk(X^)  =  i9  ]  .  Then 


(3.4.4) 


B((Pj  ) 


r  1 
0 


^i 


dx 


A  1  1 

a .  <p .  u>  dx  . 

.1  i  vi 


Proof.  Again  we  use  the  relation 


as 


I . 
J 


=  a .  s 
J 


for  any  s  € 


(3.4.4)  follows  immediately  from  this. 


It  should  be  noted  that  the  stiffness  matrix  for  the  standard  finite 


element  method  discussed  in  Subsection  3.2  is 


(3.4.5) 


,1 

a.  cp'.  <pl 

Jq  3  3  1 


where 


(3.4.6) 


Since  the  load  vector  (the  right  hand  side  in  the  discretized  problem)  is 
the  same  in  the  standard  method  as  in  the  method  discussed  in  this  sub¬ 
section,  we  see  that  the  methods  differ  only  in  the  occurrence  of  a  or 

3 

a^  ,  defined  in  (3.4.2)  and  (3.4.6),  respectively  in  the  stiffness  matrices, 
defined  in  (3.4.4)  and  (3.4.5),  respectively.  Obviously  a.  and  a.  are 
very  close  when  a(x)  is  nearly  constant  but  they  can  be  very  different 
when  a(x)  is  rapidly  changing.  This  shows  that  the  results  obtained  by 
the  standard  method  and  the  third  method  could  be  very  different,  the  third 
method  performing  equally  well  for  problems  with  smooth  coefficients  and 
strikingly  better  for  problems  with  rough  coefficients.  The  analysis  here 
has  been  for  the  norm.  We  remark  that  similar  results  hold  for  the 

norm.  Let  us  also  remark  that  the  third  method  can  be  extended  to  equations 
with  b  £  0  so  as  to  preserve  its  good  properties.  For  further  results 
see  [4]. 


4.  Conclusions  Regarding  the  Selection  of  Finite  Element  Methods 

4.1.  Introduction 

As  we  have  stated  in  the  Introduction  (Section  1)  and  have  shown  in 
the  previous  Sections,  there  are  many  finite  element  methods  which  can  be 
considered  for  any  specific  problem.  In  this  section  we  discuss  the  applica¬ 
tion  of  the  ideas  elaborated  on  above  to  the  rational  selection  or  design  of 
effective  methods. 

4.2.  Definitions  of  the  Various  Types  of  Variational  Methods 

It  is  essential  to  clarify  as  much  as  possible  the  aim  of  an  engineering 
computation,  the  set  of  possible  solutions,  the  environment  in  which  the 
computations  are  to  be  made,  and  the  various  types  of  computational  procedures 
actually  in  use  in  computational  engineering.  Toward  this  end  we  have  intro¬ 
duced  the  notions  of  simple  variational,  variational,  directed  variational, 
and  computational  variational  methods.  Nearly  all  of  the  computational 
procedu  cs  used  in  practice  fall  within  this  framework.  We  mention  here  the 
h-version  and  the  p-version  of  the  finite  element  method,  displacement  and 
mixed  methods,  various  adaptive  approaches,  etc.  The  examples  discussed  in 
Sections  2  and  3  show  that  the  same  computational  procedure  can  be  viewed  as 
different  directed  variational  or  computational  variational  methods,  the  dif¬ 
ference  being  related  to  the  use  of  different  norms  with  which  to  measure  the 
error.  We  have  seen  that  the  different  methods  have  significantly  different 
behavior.  It  appears  that  without  precise  definitions  of  the  various  types  of 
variational  methods,  a  careful  discussion  of  finite  element  methods  leading  to 
rational  choice  of  a  method  for  a  specific  problem  would  not  be  possible. 

4 . 3  Approximability  and  Optima lity 

The  notions  of  approximability  and  optimality  are  two  central  ideas  to 


be  considered  in  the  comparison  of  finite  element  methods.  Clearly  the 
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trial  space  should  be  tailored  as  well  as  possible  to  the  class  of  possible 
solutions.  The  approximability  constant  Z(u,S,X)  is  a  measure  of  how  well 
this  has  been  done.  The  trial  spaces  introduced  in  Subsections  3.3  and  3.4 
are  obviously  preferable  to  the  standard  polynomial  spaces. 

The  test  space  should  be  selected  so  as  to  lead  to  a  small  optimality 
constant  C(u,M,X)  and  so  as  to  lead  to  computational  simplicity.  The 
optimality  constant  is  a  measure  of  how  well  the  approximate  solution  performs 
in  comparison  with  the  best  possible  approximation.  In  Example  2.2.2  with 
X  =  we  have  seen  that  for  the  solution  u  ,  C(u,Mn,X2)  n,  and 

Z(u,S  ,  X„)  —  .  Thus  the  error  satisfies  I  u-u  I  n<  l//n  ,  i.e.,  the 

n  2  n  1  n  X^ 

good  approximability  properties  of  are  partially  eroded  because  of  the 

large  optimality  constant.  We  have  also  seen  that  u  leads  to  the  largest 
optimality  constant.  The  method  does,  however,  converge. 

The  second  and  third  methods  treated  in  Section  3  both  have  small 
optimality  constants,  but  the  third  method  is  much  easier  to  implement  than 
the  second. 

The  selection  of  the  test  spaces  heavily  influences  the  set  of  solutions 
which  arc  perfect,  i.e.,  the  set  of  solutions  for  which  the  method  works  well, 
assuming  good  approximability.  Sometimes  all  "reasonable"  solutions  are 
perfect,  while  the  imperfect  solutions  are  "wild".  Example  2.3.3  is  an 
illustration  of  this.  In  other  problems  reasonable  solutions  could  be  very 
imperfect.  Example  2.2.1  is  an  illustration  of  this.  That  example  also 
clearly  shows  the  effect  of  the  choice  of  the  test  space  on  the  class  of 
perfect  solutions.  In  any  case,  the  method  which  is  perfect  for  the  largest 
class  of  solutions  is  preferable. 

If  a  large  class  of  solutions  are  imperfect  and  the  optimality  constant 
-*•  +“>  as  an  n  -*■  +*  (n  being  the  number  of  degrees  of  freedom,  say),  then 
a  method  with  a  smaller  C  is  preferable  to  one  with  a  large  C  .  The 
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standard  finite  element  method  (cf.  the  discussion  in  Subsection  3.2  in  the 
constant  coefficient  case)  and  the  method  discussed  in  Example  2.2.2,  when 
considered  in  connection  with  X  =  ,  can  be  compared  in  this  way.  In  the 

°1  7- 

first  case  we  get  C(H  =  00  while  in  the  second  case  'v  /n 

In  many  situations,  rigorous  estimates  of  the  optimality  constant  are 
not  available  and  judgments  concerning  choice  of  methods  must  be  based  on 
computational  experience.  In  these  situations,  one  must  attempt  to  gain 
insight  on  the  class  of  perfect  solutions  from  the  computational  experience. 
Nevertheless,  we  have  to  be  aware  that  there  is  a  possibility  that  only 
perfect  solutions  will  be  tested  and  the  conclusions  could  be  misleading. 

Approximability  and  optimality  together  influence  the  performance  of 
the  method.  A  method  could  have  a  deteriorating  optimality  constant  and 
still  give  reasonably  good  accuracy  if  the  approximability  is  sufficiently 
good  to  offset  the  lack  of  optimality.  Thus  we  are  interested  in  the  set  of 
solutions  for  which  the  method  converges  as  well  as  the  set  of  perfect 
solutions . 

Examples  2.2.1  and  2.3.3  clearly  illustrate  the  influence  of  the  choice 
of  trial  and  test  space  on  optimality.  If  the  "major  part"  of  the  bilinear 
form  B  is  symmetric  (cf.  Example  2.2.1  for  | A | > 1) ,  then  using  the  same 
space  for  the  trial  and  test  space  is  often  advisable.  If  the  "major  part" 
of  B  is  nonsvmmetric  (cf.  Example  2.2.1  for  j  A | < 1  and  Example  2.3.3), 
then  it  is  often  useful  to  consider  different  trial  and  test  spaces. 

A  .  A  .  St ab i_l  i  tv  and  t he  S t  ab  i  1  i  tv  Cond  i  t  ion 

The  stability  constant  should  be  viewed  as  a  tool  to  be  a«ed  in  the 
analysis  of  optimality.  C(u,M,X)  and  D(u,M,X)  ,  and  also  C(ll,M,X)  and 
D(H,M,X)  ,  are  closely  related,  as  shown  in  Theorem  2.1.  The  constant  y(S,V) 
is  in  turn  used  to  estimate  the  stability  constant,  as  shown  in  Theorem  2.3. 


We  note  that  y(S,V)  does  not  depend  on  the  exact  solution  u  and  thus  that 
an  analysis  of  stability  or  optimality  based  on  an  estimate  for  y(S,V)  must 
concentrate  on  "worst"  possible  cases.  The  stability  condition  (2.3.17)  is 
necessary  in  the  sense  that  if  it  is  violated,  then  the  method  must  diverge 
for  at  least  one  solution,  as  shown  by  Theorems  2 . 2  and  2,5.  Nevertheless, 
a  method  can  violate  this  condition  and  still  perform  well  for  a  large  class 
of  solutions. 

Recently  there  have  been  statements  in  the  literature  to  the  effect 
that  certain  methods  perform  well  computationally  even  though  the  stability 
condition  (also  sometimes  called  the  inf-sup,  LBB,  or  BB  conditions)  is 
violated.  This  occurrence  can  be  explained  by  noting  that  in  the  computations 
only  perfect  solutions  were  considered.  These  particular  solutions  may  have 
been  considered  partly  on  the  basis  of  physical  insight,  but  partly  by 
accident.  Thus  we  see  that  a  detailed  analysis  of  the  structure  of  the  set 
of  perfect  solutions  is  highly  desirable.  Certainly  formal  insistance  that 
the  stability  condition  is  satisfied  is  inappropriate. 

We  see  from  Theorem  2.1,  that  the  method  discussed  {n  Example  2.2.2, 
when  considered  in  connection  with  the  L^-norm  (case  (b)),  does  not  satisfy 
the  stability  condition,  since  the  optimality  constant  as  n^  .  Still, 

on  the  basis  of  experimental  evidence,  it  is  likely  that  the  method  would 
perform  well  because  only  "reasonable"  solutions  would  be  considered.  A 
similar  situation  occurs  when  we  consider  the  performance  of  the  standard 
finite  element  method  in  the  I^-norm  or  when  considering  the  second  choice 
of  spaces  in  Example  2.3.3.  In  contrast  it  is  very  likely  that  experimentally 
one  would  generally  see  that  the  method  discussed  in  the  Example  2.2.1  has  a 
serious  deficiency  when  |  A  |  ■< 3  . 
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